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GENETIC  DIVERSITY  AND  ECOTYPIC  DIFFERENTIATION  IN 
THE  MARINE  CYANOBACTERIA  PROCHLOROCOCCUS  AND 

SYNECHOCOCCUS 

by 

Gabrielle  Rocap 

Submitted  to  the  Department  of  Biology,  Massachusetts  Institute  of  Technology  and  the 
Woods  Hole  Oceanographic  Institution,  February,  2000  in  partial  fulfillment  of  the 
requirements  for  the  degree  of  Doctor  of  Philosophy 

ABSTRACT 

Primary  production  in  the  open  ocean  is  dominated  by  the  closely  related  cyanobacterial 
genera  Prochlorococcus  and  Synechococcus.  This  thesis  explores  the  relationship 
between  physiological  and  genetic  diversity  in  cultured  isolates  of  Prochlorococcus  and 
Synechococcus  and  examines  genetic  diversity  in  natural  cyanobacterial  populations  to 
better  understand  the  ecology  of  this  globally  important  clade. 

The  analysis  of  16S  ribosomal  RNA  gene  (rDNA)  sequences  of  a  spectrum  of 
Prochlorococcus  isolates  from  the  world’s  oceans  suggested  that  Prochlorococcus  could 
be  divided  into  two  distinct  ecotypes,  designated  high  and  low  B/A  because  of  their 
differing  chi  h/a2  ratios.  Analysis  of  coexisting  Prochlorococcus  populations, 
distinguished  by  their  different  mean  chlorophyll  fluorescence  intensities,  revealed  that 
they  were  physiologically  and  genetically  distinct.  Significant  differences  were  observed 
in  their  growth  response  to  light  and  chi  b/a2  ratios  and  the  isolates  differed  by  2.5%  at 
their  16S  rDNA  sequences.  Thus  “microdiversity”  commonly  observed  in  collections  of 
16S  rDNA  sequences  amplified  directly  from  the  environment,  can  reflect  the  presence  of 
ecologically  distinct  populations. 

To  further  resolve  the  ecotypes,  the  internal  transcribed  spacer  region  (ITS)  between  the 
16  and  23S  rRNAs  was  sequenced  in  isolates  of  Prochlorococcus  and  Synechococcus. 

ITS  sequences  varied  widely  in  length  and  %GC  content.  Phylogenetic  analysis 
identified  ten  sequence  clusters,  four  of  Synechococcus  and  six  of  Prochlorococcus,  each 
of  which  likely  corresponds  to  an  ecologically  distinct  population.  The  high  B/A  ecotype 
contains  four  clusters  while  the  low  B/A  ecotype  contains  two  clusters. 

Genetic  diversity  of  natural  populations  of  Prochlorococcus  and  Synechococcus  was 
investigated  in  summer  and  winter  waters  of  the  Sargasso  Sea  by  constructing  clone 
libraries  from  PCR  amplified  ITS  sequences,  which  were  screened  by  restriction 
fragment  length  polymorphism  (RFLP).  Based  on  their  phylogenetic  relationships  the 
sequences  were  assigned  to  eight  clusters,  seven  of  which  had  been  previously  identified. 

Thesis  Advisor:  Sallie  W.  Chisholm 

Title:  Professor  of  Civil  and  Environmental  Engineering,  and  Biology,  M.I.T. 
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INTRODUCTION1 

Primary  production  in  the  open  ocean  is  dominated  by  unicellular  cyanobacteria 
Synechococcus  and  Prochlorococcus  (Waterbury,  et  al.  1979,  Chisholm,  et  al.  1988). 

The  two  genera  are  abundant  (often  105  cells  ml'1)  and  widespread  in  the  world’s  oceans 
(Olson,  et  al.  1990a,  Olson,  et  al.  1990b,  Campbell  and  Vaulot  1993,  Veldhuis  and  Kraay 
1993).  Together  they  have  been  shown  to  contribute  32  -  80%  of  the  primary  production 
in  the  oligotrophic  oceans  (Goericke  and  Welschmeyer  1993,  Li  1995,  Liu,  et  al.  1997, 
Veldhuis,  et  al.  1997). 

Prochlorococcus  and  marine  Synechococcus  are  closely  related  as  determined  by 
phylogenies  constructed  using  several  genes,  including  16S  ribosomal  RNA  (16S  rDNA), 
a  subunit  of  DNA-dependent  RNA  polymerase  ( rpoCl ),  and  the  chlorophyll  a-binding 
antenna  protein  CP47  (psbB )  (Palenik  and  Haselkom  1992,  Urbach,  et  al.  1992,  Urbach, 
et  al.  1998).  These  phylogenetic  analyses  demonstrate  that  Prochlorococcus  and  marine 
Synechococcus  form  a  single  lineage  within  the  cyanobacteria  even  though  the  two 
genera  use  completely  different  light  harvesting  systems.  Synechococcus  has 
phycobilisomes  typical  of  cyanobacteria,  with  open  ocean  Synechococcus  distinguished 
from  their  freshwater  and  coastal  counterparts  by  the  presence  of  the  phycobiliprotein 
phycoerythrin  as  their  main  accessory  pigment  (Waterbury  and  Rippka  1989).  The  major 
photosynthetic  pigments  of  Prochlorococcus  are  divinyl  chlorophyll  a  (8-desethyl,  8- 
vinyl  chlorophyll  a,  or  chi  a2),  divinyl  chlorophyll  b  (chi  b2)  (Goericke  and  Repeta  1992). 

Because  the  chapters  of  this  thesis  were  written  to  be  journal  articles  which  could  stand  alone,  there  is 
some  overlap  between  this  introduction  and  the  introductory  material  in  each  chapter. 
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This  complement  of  pigments  is  distinctly  different  from  that  of  two  other  chlorophyll  b 
containing  oxygenic  phototrophs  Prochloron  and  Prochlorothrix  (Chisholm,  et  al.  1992) 
congruent  with  their  distant  phylogenetic  relationship  (Urbach,  et  al.  1992).  Because  the 
chi -b  containing  organisms  assigned  to  the  order  Prochlorales  branch  paraphyletically 
within  the  cyanobacterial  radiation,  the  term  prochlorophyte  does  not  describe  a 
phylogenetically  coherent  group  (Pinevich,  et  al.  1997),  thus  Prochlorococcus  is  referred 
to  as  a  cyanobacterium  throughout  this  thesis. 

Marine  Synechococcus  are  larger  than  Prochlorococcus,  ranging  in  size  from  0.6-0.8  pm 
x  0.6- 1.6  pm  (Waterbury,  et  al.  1986)  while  Prochlorococcus  are  smaller  (0.4-0.6  pm  x 
0.5-0.8  pm)  (Lichtle,  et  al.  1995).  These  size  differences  have  implications  for  nutrient 
assimilation  as  the  smaller  Prochlorococcus  have  a  higher  surface-to-volume  ratio  and 
therefore  will  be  diffusion  limited  at  lower  concentrations  than  the  larger  Synechococcus 
(Chisholm  1992). 

Distribution  in  the  Oceans 

Prochlorococcus  and  Synechococcus  each  have  a  unique  flow  cytometric  signature  and 
are  easily  identified  and  enumerated  in  situ  (Chisholm,  et  al.  1988,  Olson,  et  al.  1990a). 
Using  flow  cytometry  a  large  number  of  cell  abundance  measurements  have  been 
collected  throughout  the  worlds  oceans  in  the  last  ten  years  (recently  reviewed  in 
Partensky  et.  al.  1999).  The  two  genera  have  overlapping  but  distinct  distributions. 
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Geographically,  Synechococcus  has  a  horizontally  broader  distribution,  as  it  can  be  found 
in  waters  with  low  temperatures  and  high  nutrient  concentrations  where  Prochlorococcus 
is  absent  (Partensky,  et  al.  1999a).  However,  in  temperate  oligotrophic  waters  where 
they  coexist,  Prochlorococcus  has  a  greater  vertical  range,  as  it  is  observed  in  abundance 
much  deeper  in  the  water  column  than  Synechococcus,  because  divinyl  chi  b  allows  it  to 
more  effectively  harvest  the  low  light  intensities  and  blue  wavelengths  characteristic  of 
the  deep  euphotic  zone  (Morel,  et  al.  1993,  Moore,  et  al.  1995). 

The  two  genera  also  exhibit  differences  in  seasonal  distributions  in  the  oceanic  regimes 
where  they  co-occur.  In  the  Atlantic  Ocean  Prochlorococcus  and  Synechococcus  have 
opposite  seasonal  cycles  (Olson,  et  al.  1990b,  DuRand,  et  al.  2000).  Integrated 
concentrations  of  Prochlorococcus  are  maximal  in  the  stably  stratified,  nutrient  depleted 
waters  in  late  summer  and  early  fall  when  Synechococcus  are  at  a  minimum.  Conversely 
integrated  cell  concentrations  of  Synechococcus  are  maximal  in  the  deeply  mixed  waters 
in  early  spring  when  Prochlorococcus  are  at  a  minimum.  A  more  pronounced  version  of 
this  pattern  is  observed  in  the  Gulf  of  Aqaba  off  the  Red  Sea  where  Prochlorococcus  is 
not  detectable  in  winter  months  when  mixing  is  as  deep  as  300-600m  (Lindell  and  Post 
1995).  Here,  Synechococcus  are  dominant  in  the  autumn  and  spring  at  the  onset  and  the 
break  up  of  stratified  conditions,  while  Prochlorococcus  is  numerically  dominant  in  the 
stratified  low  nutrient  summer  waters.  In  the  permanently  stratified  subtropical  Pacific  at 
the  HOT  station,  the  setting  is  similar  to  summer  conditions  at  the  two  above  sites  all 
year  round.  The  water  column  never  mixes  below  the  euphotic  zone,  nitrogen  remains 
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undetectable  in  the  surface,  and  Prochlorococcus  outnumber  Synechococcus  throughout 
the  euphotic  zone  (Campbell  and  Vaulot  1993,  Campbell,  et  al.  1997). 

The  water  columns  described  above  differ  in  a  number  of  environmental  parameters, 
including  light  availability  and  fluctuations,  temperature,  and  nutrient  and  trace  metal 
concentrations.  Any  or  all  of  these  factors  may  play  a  role  in  the  observed  distributions. 
However,  the  above  discussion  treats  Prochlorococcus  and  Synechococcus  as  two 
homogenous  groups,  when  in  fact  they  are  each  composed  of  multiple  physiological 
types  which  may  react  differently  to  each  of  the  parameters  mentioned.  It  is  probably  not 
possible  to  interpret  the  distributions  of  each  genera  without  an  understanding  of  these 
multiple  groups. 

Synechococcus 

The  name  Synechococcus  has  been  given  to  a  supra-generic  group  of  small  (<  3  pm) 
unicellular  coccoid  cyanobacteria  which  lack  sheaths  and  divide  by  binary  fission  in  a 
single  plane  (Waterbury  and  Rippka  1989).  Although  similar  morphologically,  strains  of 
the  Synechococcus  group  are  quite  diverse,  spanning  a  large  range  of  DNA  base  ratios 
(39-71  mol  %  G+C).  They  have  been  provisionally  divided  into  6  genus  level  strain 
clusters  (Cyanobacterium,  Synechococcus,  Cyanobium,  marine  A,  marine  B,  and  marine 
C)  until  further  strain  characterization  and  taxonomic  analyses  can  be  applied  (Waterbury 
and  Rippka  1989).  Recent  work  using  16S  rDNA  sequences  indicates  that  the  six 
clusters  are  paraphyletic  within  the  cyanobacterial  radiation.  However,  strains  from  the 
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marine  A,  marine  B  and  Cyanobium  clusters  form  a  monophyletic  clade  (Honda,  et  al. 
1999).  Open  ocean  Synechococcus  isolates  all  belong  to  the  marine  A  cluster,  although 
marine  A  cluster  strains  have  also  been  found  in  coastal  waters.  In  contrast,  marine  B 
and  C  isolates  are  exclusively  from  coastal  or  brackish  waters. 


The  marine  A  cluster  Synechococcus  encompass  a  physiologically  diverse  set  of  strains. 
Although  all  use  phycoerythrin  as  their  major  light  harvesting  pigment,  some  also  possess 
the  chromophore  phycourobilin  (PUB)  which  can  attach  to  phycoerythrin  in  place  of 
phycoerythrobilin  (PEB)  (Ong  and  Glazer  1991).  The  relative  amounts  of  PUB  and  PEB 
vary  among  strains  (Waterbury,  et  al.  1986).  In  addition,  some  isolates  are  capable  of  a 
novel  form  of  swimming  motility  (Waterbury,  et  al.  1985,  Waterbury,  et  al.  1986). 

Marine  A  Synechococcus  also  vary  in  the  ability  to  utilize  organic  nutrient  sources  and  in 
their  cell  cycle  behavior  (Waterbury,  et  al.  1986,  Binder  and  Chisholm  1995). 

There  is  some  evidence  that  the  distinct  physiological  types  of  marine  A  Synechococcus 
differ  in  their  distributions.  Both  low  and  high  PUB  type  Synechococcus  are  present  in 
both  coastal  and  open  ocean  waters,  while  those  lacking  PUB  are  observed  only  in 
coastal  waters  (Olson,  et  al.  1988,  Olson,  et  al.  1990a),  likely  because  of  the  spectral 
advantage  of  PUB  in  the  blue  light  dominated  euphotic  zone  of  the  open  ocean  (Wood,  et 
al.  1998).  Viruses  may  also  play  a  role  in  controlling  the  distribution  and  diversity,  as  the 
majority  of  Synechococcus  in  natural  populations  are  resistant  to  their  co-occurring 
phages  (Waterbury  and  Valois  1993). 
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Marine  A  Synechococcus  are  also  diverse  genetically,  exemplified  by  the  range  of  mol 
percent  G+C  content  of  their  genomes  (55-62%)  (Waterbury,  et  al.  1986).  Using  whole 
genome  RFLP  analysis  four  distinct  lineages  were  identified,  each  as  divergent  from  one 
another  as  from  freshwater  Synechococcus  (Douglas  and  Carr  1988,  Wood  and  Townsend 
1990).  Examination  of  strains  from  the  California  Current  using  rpoCl  sequences 
identified  two  clusters,  one  containing  high  PUB  isolates  and  the  other  consisting  of  low 
PUB  isolates  (Toledo  and  Palenik  1997).  Each  of  these  lineages  was  distinct  from  the 
typical  laboratory  model  high  and  low  PUB  strains  WH8103  and  WH7803,  suggesting 
that  pigment  content  alone  may  not  resolve  the  multiple  groups  of  marine  Synechococcus 
(Toledo  and  Palenik  1997).  Recently  rpoCl  sequences  were  used  to  determine  that 
motile  isolates  with  a  range  of  PUB  contents  are  contained  in  a  single  clade  (Toledo,  et 
al.  1999).  Finally,  analyses  using  16S  rDNA  sequences  suggested  that  the  high  PUB 
strain  WH8103  and  the  no  PUB  strain  WH7805  are  lineages  taxonomically  equivalent  to 
the  marine  A  and  B  divisions  (Urbach,  et  al.  1998). 

Prochlorococcus 

Prochlorococcus  are  small,  non-motile  and,  uniquely  among  organisms,  contain  divinyl 
chlorophylls  a  and  b  as  their  major  photosynthetic  pigments,  although  some  isolates  also 
have  mono  vinyl  chi  b  (Moore  and  Chisholm  1999).  In  addition,  some  isolates  of 
Prochlorococcus  possess  the  genes  for  phycoerythrin  ( cpeA ,  cpeB),  (Hess,  et  al.  1996, 
Ting,  et  al.  1999),  although  its  functional  role  is  not  yet  clear.  In  the  relatively  short  time 
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since  the  original  description  of  Prochlorococcus  rapid  advances  have  been  made  in 
understanding  their  ecology,  physiology  and  genetics,  which  is  the  subject  of  a  recent 
review  (Partensky,  et  al.  1999b). 

The  original  open  ocean  divinyl  chlorophyll  containing  isolate  (strain  SSI 20)  was  named 
Prochlorococcus  marinus  (Chisholm,  et  al.  1992).  As  additional  isolates  were 
characterized,  their  physiological  and  genetic  diversity  (see  below)  suggested  that  they 
might  comprise  more  than  one  species.  Until  more  formal  taxonomic  distinctions  are 
made,  a  terminology  parallel  to  Synechococcus  has  been  adopted,  with  isolates  referred  to 
as  strains  of  the  genus  Prochlorococcus. 

Cultured  isolates  of  Prochlorococcus  have  been  divided  into  two  genetically  and 
physiologically  distinct  groups,  named  based  on  their  most  striking  and  easy  to  measure 
difference:  the  ratio  of  chi  b/a2  over  a  range  of  growth  irradiances  (Moore  and  Chisholm 
1999).  High  B/A  isolates  have  higher  chi  b/a2  ratios,  chlorophyll  a2-specific  light 
harvesting  efficiencies  and  spectrally-weighted  average  absorption  coefficients,  allowing 
them  to  grow  at  extremely  low  irradiances  (less  than  5  pmol  Q  m'2  s'1)  where  low  B/A 
isolates  are  incapable  of  growth.  Conversely,  low  B/A  isolates  are  able  to  grow 
maximally  at  higher  light  levels  where  high  B/A  isolates  are  inhibited  (Moore  and 
Chisholm  1999).  Adaptation  to  light  is  not  the  only  distinguishing  feature  of  the  two 
groups.  They  also  differ  in  their  sensitivity  to  copper  toxicity,  with  low  B/A  isolates  able 
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to  grow  at  free  cupric  ion  concentrations  five  times  higher  than  those  high  B/A  isolates 
can  tolerate  (Mann  2000). 

The  different  physiologies  of  the  two  types  of  Prochlorococcus  suggest  that  they  might 
have  different  distributions  in  the  field,  with  high  B/A  Prochlorococcus  predominating  in 
low-light  waters  in  the  deep  euphotic  zone.  Low  B/A  isolates  may  dominate  in  high  light 
surface  waters  where  free  cupric  ion  concentrations  are  higher,  (at  least  in  the  summer  in 
the  Sargasso  Sea  (Moffett  1995)).  Consistent  with  this  proposed  distribution,  chi  b/a2 
ratios  in  natural  waters  have  been  observed  to  increase  with  depth,  and  the  range  is  larger 
than  observed  for  any  single  isolate  of  Prochlorococcus  (Goericke  and  Repeta  1993). 
Further  evidence  suggesting  that  ecologically  distinct  populations  occupy  different  depths 
in  the  euphotic  zone  comes  from  copper  additions  to  natural  populations  from  the 
Sargasso  Sea.  In  bottle  incubations  Prochlorococcus  taken  from  deep  samples 
disappeared  immediately  upon  the  addition  of  copper,  while  populations  from  surface 
waters  remained  stable,  consistent  with  the  greater  sensitivity  of  high  B/A  isolates  to 
copper  (Mann  2000). 

Ecotypes 

Clusters  of  strains  that  are  closely  related  genetically,  as  observed  in  Prochlorococcus 
and  Synechococcus,  are  a  common  feature  in  the  microbial  world  (Ward,  et  al.  1998). 

The  multiple  groups  of  Prochlorococcus  and  Synechococcus  are  referred  to  as  ecotypes 
because  it  has  become  clear  that  their  differing  physiologies  have  implications  for  their 
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ecological  distributions.  It  has  been  suggested  that  such  clusters  should  be  at  the  heart  of 
a  natural  species  concept  for  bacteria  (Ward  1998).  Without  the  ability  to  interbreed  as  a 
criterion  to  delineate  species,  microbiologists  must  use  other  means  to  define  a  “lineage 
evolving  separately  from  others  with  its  own  unitary  evolutionary  role  (ecological  niche)” 
(Ward  1998).  That  is,  a  bacterial  species  is  defined  as  an  ecologically  coherent 
population.  Such  populations  may  coexist  in  a  guild  (a  group  of  species  that  exploit  the 
same  class  of  environmental  resources)  but  will  respond  differently  to  environmental 
determinants. 


Evolutionary  theory  predicts  that  ecologically  distinct  populations  will  correspond  to 
sequence  similarity  clusters  (Cohan  1995,  Palys,  et  al.  1997,  Ward  1998,  Ward,  et  al. 
1998).  The  asexual  nature  of  bacterial  evolution  increases  divergence  between  species 
while  decreasing  divergence  within  species.  As  clonal  descendants  of  individuals  with 
favorable  mutations  at  functional  genes  sweep  through  a  population,  evolutionarily 
neutral  mutations  at  all  other  loci  are  carried  along  as  well,  purging  the  population  of 
diversity  at  all  loci  (Palys,  et  al.  1997).  Thus,  these  periodic  selection  events  actually 
increase  the  divergence  at  all  loci  between  ecologically  distinct  populations.  Therefore 
ecologically  neutral  mutations  in  rDNA  or  other  “housekeeping  genes”  should  be 
excellent  markers  for  distinct  ecotypes. 

Goals  of  this  thesis 

When  this  research  was  begun,  it  had  been  established  that  Prochlorococcus  were  closely 
related  to  marine  A  Synechococcus  and  not  specifically  to  the  two  other  chlorophyll  b 
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containing  microorganisms  or  to  chloroplasts  (Palenik  and  Haselkom  1992,  Urbach,  et  al. 
1992).  The  physiological  diversity  of  the  first  few  Prochlorococcus  isolates  was 
beginning  to  be  explored  (Morel,  et  al.  1993,  Partensky,  et  al.  1993,  Moore,  et  al.  1995). 
The  use  of  ribosomal  RNA  as  a  tool  to  determine  evolutionary  relationships  among 
cultured  microorganisms  was  well  established  (Woese  1987).  Studies  of  rDNA 
sequences  to  investigate  diversity  in  the  environment  without  culturing  were  beginning  to 
inform  microbial  ecology  in  both  terrestrial  and  marine  environments  (Giovannoni,  et  al. 
1990,  Ward,  et  al.  1990,  Schmidt,  et  al.  1991,  Fuhrman,  et  al.  1993). 

The  first  objective  of  this  thesis  was  to  explore  the  genetic  diversity  of  Prochlorococcus 
isolates  in  the  context  of  their  physiological  diversity.  As  the  major  physiological 
distinctions  between  the  isolates  were  discovered,  the  parallel  differences  in  16S  rDNA 
sequences  helped  establish  the  two  ecotypes  of  Prochlorococcus  (Chapters  2  &  3).  Once 
this  correlation  was  established,  the  internal  transcribed  spacer  region  (ITS)  between  the 
16S  and  the  23S  rRNAs  was  used  to  further  resolve  the  ecotypes  of  Prochlorococcus  and 
Synechococcus  (Chapter  4).  Next,  to  assess  how  representative  cultured  isolates  might  be 
of  natural  populations,  and  to  increase  the  database  of  sequences  available  for  the  design 
of  ecotype  specific  probes,  the  genetic  diversity  at  the  ITS  was  explored  in  field  samples 
(Chapter  5).  Isolates  and  environmental  sequences  have  been  assigned  to  sequence 
similarity  clusters,  each  of  which  may  correspond  to  an  ecologically  distinct  population. 
Signature  sequences  have  been  identified  for  many  of  the  clusters  (Chapter  6)  which  in 
the  future  may  be  used  as  probes  in  quantitative  hybridizations  to  determine  the  relative 
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abundances  of  the  ecotypes  in  natural  populations.  This  will  ultimately  provide  a  better 
understanding  of  the  distributions  of  these  two  important  primary  producers. 
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direct  evidence  supporting  this  hypothesis,  which  has  been 
generated  by  isolating  and  analysing  distinct  co-occurring  popu¬ 
lations  of  Prochlorococctts  at  two  locations  in  the  North  Atlantic. 
Co-isolates  from  the  same  water  sample  have  very  different  light- 
dependent  physiologies,  one  growing  maximally  at  light  inten¬ 
sities  at  which  the  other  is  completely  photoinhibited.  Despite 
this  ecotypic  differentiation,  the  co-isolates  have  97%  similarity  in 
their  16S  ribosomal  RNA  sequences,  demonstrating  that 
molecular  microdiversity,  commonly  observed  in  microbial 
systems7"12,  can  be  due  to  the  coexistence  of  closely  related, 
physiologically  distinct  populations.  The  coexistence  and  distri¬ 
bution  of  multiple  ecotypes  permits  the  survival  of  the  population 
as  a  whole  over  a  broader  range  of  environmental  conditions  than 
would  be  possible  for  a  homogeneous  population. 

Using  sea-going  flow  cytometry  for  studying  picoplankton  popu¬ 
lations,  we  and  others4,13,14  have  observed  multiple  populations 
of  Prochlorococcus  in  single  water  samples,  as  distinguished  by 
their  chlorophyll  fluorescence  intensities.  These  populations  could 
be  derived  from  the  mixing  together  of  genetically  identical 
Prochlorococcus  cells  which  have  acclimated  to  different  past  light 
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The  cyanobacterium  Prochlorococcus 3,2  is  the  dominant  oxygenic 
phototroph  in  the  tropical  and  subtropical  regions  of  the  world’s 
oceans1,3’4.  It  can  grow  at  a  range  of  depths  over  which  light 
intensities  can  vary  by  up  to  4  orders  of  magnitude.  This  broad 
depth  distribution  has  been  hypothesized  to  stem  from  the 
coexistence  of  genetically  different  populations  adapted  for 
growth  at  high-  and  low-light  intensities4"6.  Here  we  report 
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Figure  1  Properties  of  the  euphotic  zone  and  flow  cytometric  signatures  of  the 
Prochlorococcus  populations  and  isolates,  a.  b.  The  physical  features  of  the 
water,  columns  were  similar  at  the  Gulf  Stream  station.  37°  30.8'N,  68*  14.477,  and 
the  Sargasso  Sea  station,  34*  46.5'N,  66°  11.1  TV.  ©.  d,  Row  cytometry  signatures  of 
coexisting  Prochlorococcus  populations  from  135  m  in  the  Gulf  Stream  and  100  m 
in  the  Sargasso  Sea  from  which  the  isolates  were  obtained,  e-h.  Row  cytometry 
signatures  of  the  cultured  isolates  maintained  at  an  irradiance  of  9  jimo!  Q  rrf*  s“’. 
Numbers  in  parentheses  refer  to  the  mean  chlorophyll  fluorescence  per  cell  (FL) 
and  FALS  per  cell.  Differences  in  the  absolute  values  of  the  flow  cytometry 
parameters  between  natural  populations  and  isolates  result  from  unmatched 
growth  conditions. 
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and/or  nutrient  regimes,  or  they  could  represent  genetically  distinct 
populations,  adapted  for  optimal  growth  under  different  conditions 
but  co-occurring  in  environments  where  conditions  are  favourable 
for  both.  The  latter  interpretation  has  been  suggested  by  indirect 
evidence  from  field  studies413,14,  from  physiological  diversity  among 
Prochlorococcus  isolates6,  and  from  studies  of  the  molecular  phylo¬ 
genetic  diversity  of  marine  cyanobacteria,  which  have  revealed  a 
number  of  closely  related  gene  sequences7-9.  This  high  microdiver¬ 
sity  has  also  been  observed  for  marine  bacterioplankton7,6,10  and 
microorganisms  in  other  environments1  u2,  although  the  ecological 
significance  has  remained  unclear.  Differences  in  RNA  polymerase 
gene  sequences  have  been  correlated  with  pigment  content  in 
marine  Synechococcus15;  however,  molecular  microdiversity  observed 
in  the  more  widely  used  16S  rRNA  gene  has  not  been  connected 
directly  to  physiological  function  in  any  group  of  microorganisms 
from  a  particular  habitat.  The  multiple  Prochlorococcus  populations 
observed  by  flow  cytometry  provided  an  excellent  opportunity  for 
studying  these  relationships. 

In  two  locations  of  the  North  Atlantic  Ocean  at  the  junction  of 
two  subsurface  chlorophyll  fluorescence  maximum  layers,  we  used 
flow  cytometry  to  sort  the  high-  and  low-fluorescence  cells  from 
coexisting  Prochlorococcus  populations  and  then  brought  them  into 
culture  (Fig.  1).  After  several  years  of  growing  under  identical 
conditions,  the  cells  sorted  from  a  Gulf  Stream  water  sample 
(isolates  MIT9312  and  MIT9313)  and  a  Sargasso  Sea  water 
sample  (isolates  MIT9302  and  MIT9303)  have  maintained  their 
differences  in  chlorophyll  fluorescence  and  forward-angle  light 
scatter  (FALS)  per  cell  (Fig.  le-h),  indicating  that  the  phenotypic 
differences  originally  observed  in  these  coexisting  populations  were 
due  to  genetic  differences. 

We  hypothesized  that  the  high-  and  low-fluorescence 
Prochlorococcus  are  low-  and  high-light  adapted,  respectively, 
because  differences  in  flow  cytometry  parameters  are  correlated 
with  differences  in  photophysiology  between  Prochlorococcus  SS 1 20, 
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Figure  2  Growth  and  photosynthetic  rate  as  a  function  of  irradiance  for  the 
Prochlorococcus  isolates  shown  in  Fig.  1.  a.  b.  Specific  growth  rate  as  a  function 
of  growth  irradiance.  c,  d,  Photosynthetic  rate  (normalized  to  chi  a2)  as  a 
function  of  irradiance  for  the  four  isolates  when  grown  at  an  irradiance  of 
9|urQm-2s-1.  Symbols  and  error  bars  correspond  to  the  mean  ±  1  s.e.  of 
duplicate  measurements. 


isolated  from  the  Sargasso  Sea,  and  MED4,  isolated  from  the 
Mediterranean  Sea6.  To  test  this  hypothesis,  we  compared  these 
four  isolates  with  respect  to  their  light-dependent  growth  response. 
The  pattern  of  growth  rate  as  a  function  of  irradiance  differs  greatly 
between  each  coexisting  pair  (Fig.  2a,  b).  The  most  striking 
difference  is  that  the  high-fluorescence  isolates,  MIT9303  and 
MIT9313,  show  complete  photoinhibition  of  growth  at  irradiances 
where  the  low-fluorescence  isolates,  MIT9302  and  MIT9312,  are 
growing  at  or  close  to  maximal  rates  (Fig.  2a,  b).  Additionally, 
MIT9303  and  MIT9313  grow  better  at  lower  light  than  their  co¬ 
isolates  (Fig.  2a,  b). 

To  understand  better  the  physiological  underpinnings  of  the 
differential  growth  response  of  the  isolates,  we  compared  their 
pigment  content,  absorption  properties  and  photosynthetic  perfor¬ 
mance  when  grown  at  low  irradiance.  The  high-fluorescence  iso¬ 
lates,  MIT9303  and  MIT9313,  contain  higher  concentrations  of 
divinyl  chlorophylls  a  and  b  (chi  a2  and  chi  h?)  relative  to  their  low- 
fluorescence  co-isolates  MIT9302  and  MIT93 12  (Table  1).  They  also 
contain  higher  ratios  of  chi  resulting  in  higher  spectrally 
weighted  average  chlorophyll  fl2specific  absorption  coefficients,  a*w 
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Figure  3  Phylogenetic  relationships  of  Prochlorococcus  and  Synechococcus 
isolates  and  environmental  sequences  from  the  Sargasso  Sea  ('SAR')  inferred 
from  16S  rRNA  sequences.  One  of  the  three  most  probable  trees  found  in  a 
maximum  likelihood  analysis  is  presented.  (Only  the  relative  branching  orders  of 
Prochlorococcus  isolates  MED4,  TATLIa  and  NATL2  and  the  environmental 
sequence  SAR6  differ  between  the  three  trees.)  Bootstrap  proportions  are 
percentages  of  100  resampled  data  sets  from  analyses  of  maximum  likelihood 
(ML),  distance  (using  minimum  evolution  as  the  objective  function  (ME))  and 
maximum  parsimony  (MP);  values  below  50  are  not  shown.  The  freshwater 
cyanobacterium  Synechococcus  PCC  630t  was  used  to  root  the  tree.  Analyses 
with  additional  members  of  the  cyanobacterial  radiation  resulted  in  essentially 
similar  branching  orders  (data  not  shown). 
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Table  1  Physiological  characteristics  of  Pro chlorococcus  isolates  grown  at  a 
low  growth  irradiance  of  9  ^mol  Q  m^s"1 

Gulf  Stream  Sargasso  Sea 


Parameter 

MIT9312 

M1T9313 

MIT9302 

MIT9303 

Pigments 

chla? 

3.15 

5.1 

3.2 

4.2 

chi  62 

2.08 

6.0 

2.1 

5.0 

chi  b2/a2 

0.661 

1.175 

0.65 

1.19 

Absorption 

Km 

0.0152 

0.0213 

0.0147 

0.0232 

Photosynthetic 

“ctite 

0.057 

0.118 

0.055 

0.125 

1.8 

2.4 

18 

2.54 

&nax 

0.086 

0.128 

0.086 

0.125 

Chi  a2  and  chi  b2  arc  in  units  of  fg  per  cell;  chi  b2/a2  are  on  a  weight/weight  basis.  5^  is 
presented  in  units  of  (m  (mg  chla2)~').  Photosynthetic  parameters  are  in  the  following  units: 

-  fgC  (fg  chi  a2y  h'1  (|unolQnr2s-,r‘:  PS55  =  fgC  (fg  chi  azj-’h-’;  and  =  mol 
carbon  (molQ)- .  All  values  are  means  of  duplicate  cultures  with  coefficients  of  variance 


(Table  1).  Furthermore,  the  higher  chi  bja 2  ratio  of  MIT9303  and 
MIT9313  drives  a  2.1— 2.3  times  higher  chi  fl2-$pecific  photosyn¬ 
thetic  efficiency  (0^0)  and  results  in  higher  maximum  quantum 
yields  ( <f>m2X )  relative  to  MIT9302  and  MIT9312  (Fig.  2c,  d;  Table  1). 
Thus,  we  conclude  that  the  isolates  M1T9303  and  MIT93 1 3  are  low- 
light  adapted  and  MIT9302  and  MIT9312  are  high-light  adapted. 

We  next  explored  whether  the  physiology  of  these  four  isolates  is 
related  to  their  phylogeny  by  sequencing  their  16S  rRNA  genes. 
Although  they  were  isolated  from  different  waters,  the  two  high¬ 
light  adapted  isolates,  MIT9302  and  MIT9312,  have  very  high 
sequence  similarity  (99.7%),  as  do  the  two  low-light  adapted 
isolates,  MIT9303  and  MIT9313  (99.2%).  The  two  co-isolates 
from  each  region  are  more  disparate  than  the  high-  and  low-light 
adapted  pairs:  the  Sargasso  Sea  co-isolates  MIT9302  and  MIT9303 
are  only  97.3%  similar,  whereas  the  Gulf  Stream  co-isolates 
MIT9312  and  MIT9313  are  97.7%  similar.  The  divergence  between 
the  co-isolates  is  meaningful  because  their  degrees  of  similarity  are 
comparable  to  those  between  the  low-light  adapted  Prochlorococcus 
isolates  and  marine  isolates  of  phycobilisome-containing 
Synechococcus  (96.9-98%).  A  phylogenetic  tree  constructed  from 
the  coexisting  Prochlorococcus  and  other  cyanobacterial  sequences 
shows  that  the  high-light  adapted  MIT9312  and  M1T9302  cluster 
with  the  high-light  adapted  MED4  and  two  isolates  from  the  Pacific, 
SB  and  GP2,  which  have  low  ratios  of  chi  bla2  (ref.  16)  (Fig.  3).  The 
monophyly  of  this  ‘high-light  adapted  clade’17  is  well  supported  in 
all  analyses.  The  low-light  adapted  MIT9303  and  MIT9313  are  on 
separate,  more  basal  branches  of  the  tree  (Fig.  3).  Thus,  the 
phylogenetic  relationship  is  correlated  with  the  phenotypic  rela¬ 
tionship. 

On  the  basis  of  the  physiology  and  molecular  phylogeny  of 
these  co-isolates,  the  previously  characterized  isolates,  SSI 20  and 
MED46,17  19,  and  four  additional  isolates  from  the  Pacific  Ocean  (ref. 
20  and  G.R.  and  S.W.C.,  unpublished  results),  it  appears  that  at 
least  two  distinct  ecotypes  can  be  identified  for  isolates  of  the  genus 
Prochlorococcus .  This  is  analogous  to  two  types  of  marine 
Synechococcus,  which  have  different  ratios  of  phycourobilin  to 
phycoerythrobilin21,  and  which  appear  to  be  distributed  differently 
on  the  basis  of  their  relative  light  harvesting  abilities22.  It  has  been 
suggested  that  low-light  adapted,  high  chi  bja-i  Prochlorococcus 
predominate  in  the  deeper  portion  of  the  euphotic  zone  where 
nutrients  are  abundant,  and  that  high-light  adapted,  low  chi  bz/a2 
Prochlorococcus  predominate  in  the  surface  where  nutrients  are 
typically  limiting4’6.  This  distribution  of  multiple  Prochlorococcus 
ecotypes  in  the  same  water  column  would  result  in  greater  inte¬ 
grated  production  than  could  be  achieved  by  a  single  ecotype.  For 
example,  if  we  estimate  primary  production  (P  =  a M+mCcc,E)  for 
the  same  number  of  cells  (tv)  in  the  low  light  of  the  deep  euphotic 
zone  (£  =  lOpjnolQm  2s_l)  (see  Methods),  using  the  strain- 
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specific  values  of  %hlat  <f>m  and  chi  a2  per  cell  (Cc),  then  the 
contribution  of  the  low-light  adapted  MIT9313  to  primary  produc¬ 
tion  would  be  3.4  times  higher  than  that  of  its  high-light  adapted 
coisolate  MIT9312.  Thus,  the  use  of  a  single  set  of  physiological 
parameters  will  result  in  errors  in  the  estimates  of  Prochlorococcus 
primary  production  in  the  oceans. 

Our  results  show  that  high  phylogenetic  microdiversity  observed 
for  coexisting  marine  cyanobacterial  picoplankton7’9  is  due,  in  part, 
to  physiological  diversity  within  the  Prochlorococcus  population. 
Although  studies  using  allozyme  banding  patterns  have  demon¬ 
strated  ecotypic  differentiation  in  eukaryotic  phytoplankton23,24,  we 
have  gone  a  step  further  in  that  we  have  specifically  linked  physio¬ 
logical  diversity  to  small  differences  in  16S  rRNA  sequences.  Thus, 
high  microdiversity  at  the  16S  rRNA  locus  observed  in  other 
microbial  communities  examined  primarily  by  comparative 
sequence  analysis  may  also  reflect  physiologically  distinct  microbial 
populations.  The  existence  of  multiple  ecotypes  in  a  particular 
environment  may  be  a  general  phenomenon  in  the  microbial  world, 
allowing  for  survival  over  a  broader  range  of  conditions  than  could 
be  achieved  by  a  physiologically  and  genetically  homogeneous 
population.  ri 


Methods 

Flow  cytometry  and  culturing.  Field  populations  were  identified  as 
Prochlorococcus  on  the  basis  of  the  characteristic  low  chlorophyll  fluorescence 
and  FALS  signature  on  the  flow  cytometer1.  Populations  were  designated  as 
'double’  when  bimodal  signatures  were  visible,  defined  by  different  fluores¬ 
cence  and  FALS  signals  (Fig.  1).  Coexisting  Prochlorococcus  populations  were 
sorted  from  each  other  into  sterile  test  tubes  of  modified  K/10-Cu  media  with 
1*17  P-M  EDTA6  using  an  EPICS  753  flow  cytometer  (Coulter  Corp.).  Cells  were 
grown  at  21  °C  and  ^ixmolquantam-2*’1  blue  light  (14:10  lightdark) 
immediately  after  isolation  on  the  ship,  and  back  in  the  laboratory  at  24  °C 
and  varying  light  intensities  obtained  using  neutral  density  filters  and  cool- 
white  fluorescent  bulbs.  For  photosynthesis,  pigment  and  absorption  measure¬ 
ments,  cells  were  grown  at  a  low  irradiance  (9|imolQm’2s‘1),  where  differ¬ 
ences  between  SS120  (CCMP1375)  and  MED4  (CCMP1378)  were  apparent6. 
Cell  counts  for  growth  rate  calculations  and  normalization  of  pigment  data 
were  obtained  using  a  FACScan  flow  cytometer  (Becton-Dickinson),  and  mean 
chlorophyll  fluorescence  per  cell  and  FALS  are  presented  relative  to  0.474  pm 
yellow-green  fluorescent  beads  (Polysciences).  Flow  cytometric  data  were 
analysed  using  ‘CytoPC’  software  provided  by  D.  Vaulot  (Station  Biologjque, 
Roscoff,  France). 

Photosynthesis,  pigment  and  absorption  measurements.  Photosynth- 
esis-irradiance  measurements  were  carried  out  on  exponentially  growing  cells 
spiked  with  NaHl4C03  (0.1  pCi  mT1  culture;  specific  activity  between  100,000 
and  200,000  d.p.m.).  The  spiked  culture  was  dispensed  into  glass  scintillation 
vials  (1  ml  each)  and  incubated  at  24  °C  for  45  min  over  a  range  of  irradiances 
obtained  using  very  high  output/daylight  spectrum  fluorescence  bulbs 
attenuated  with  neutral  density  filters  and  measured  with  a  4t  quantum 
light  meter  (model  QSL-100,  Biospherical  Instruments,  San  Diego,  CA). 
Photosynthesis  was  terminated  and  unincorporated  ,4C  removed  by  adding 
2  M  HC1  ( 1 00  pi)  and  shaking  for  —2  h  under  a  fume  hood.  Samples  were  then 
analysed  by  scintillation  spectrometry.  The  photosynthetic  parameters, 
and  or,  were  obtained  by  fitting  the  results  to  the  equation  from  ref.  25,  using  the 
curve-fitting  program  in  SigmaPIot  (Jandel  Scientific),  and  normalized  to  chi 
a2  and  chi  fc*  which  were  measured  spectrophotometrically26.  In  vivo  chloro¬ 
phyll  a2-specific  absorption,  a^w(A)  (m2  (mg  chi  a2)_1),  was  obtained  using  an 
opal  diffuser  on  a  Beckman  DU-640  spectrophotometer,  and  the  weighted 
average  (a^j)  was  calculated  over  the  photosynthetically  available  radiation 
range  (400-700  nm).  Maximum  quantum  yield,  4*^,  is  calculated  as  the  ratio 
of  adJa  to  a^j. 

Molecular  phylogenetic  analysis.  Genomic  DNA  was  isolated  from  cultures 
according  to  standard  methods27.  16S  rDNA  was  amplified  using  the  general 
eubacterial  primers  8-27f  (AGAGTTTGATCCTGGCTCAG)  and  1504-1486r 
(CTTGTTACGACTTCACCCC).  Polymerase  chain  reactions  (PCRs)  were 
performed  in  quintuplicate,  using  the  high-fidelity  polymerase  Pfu  (Strata- 
gene).  Reactions  were  pooled  and  purified  using  Q1A  quick  kit  (QIAGEN, 
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Chatsworth,  CA)  and  cloned  using  the  pCR- Script  kit  (Stratagene).  A  mini¬ 
mum  of  8  clones  from  each  culture  were  sequenced  on  an  automated  sequencer 
(LI-COR,  Lincoln,  NE).  Sequences  were  aligned  manually  with  other  marine 
cyanobacterial  sequences  available  in  the  Ribosomal  Database  Project2*  using 
the  Genetic  Data  Environment29.  The  sequence  for  MIT9303  obtained  pre¬ 
viously  from  a  PCR  product17  is  contained  within  the  sequence  we  report  here. 

A  total  of  1,094  unambiguously  aligned  and  determined  nucleotides  were  used 
in  the  analyses.  Phylogenetic  analyses  used  PAUP*  (version  4.0d47>  provided  by 
D.  Swofford).  For  both  distance  and  maximum  likelihood  analyses  the  model  of 
nucleotide  substitution  used  was  the  Hasegawa  Kishino  Yana  1985  model. 

Nucleotide  frequencies  and  the  transition  transversion  ratio  were  estimated 
from  the  data. 
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Prochlorococcus  is  the  dominant  phototroph  in  the  temperate  oceans.  It  is  found 
throughout  the  euphotic  zone,  at  light  intensities  spanning  over  four  orders  of 
magnitude.  This  growth  range  is  larger  than  observed  for  any  single  culture  of 
Prochlorococcus  and  is  likely  due  to  the  co-existence  of  multiple  ecotypes  adapted 
for  optimal  growth  at  different  light  intensities.  Here  we  report  the  16S  rDNA 
sequences  of  four  Prochlorococcus  isolates  from  the  Pacific  Ocean,  MIT9201, 
MIT9202,  MIT9211  and  MIT9215,  which  have  been  physiologically  characterized  as 
high  or  low  B/A  ecotypes  based  on  their  chi  h/ci2  ratios  and  light-dependent  growth 
responses.  The  three  low  B/A  isolates  are  closely  related  and  cluster  together  with 
other  low  B/A  isolates  in  a  previously  identified  “high-light  adapted”  clade.  The 
fourth  Pacific  isolate,  a  high  B/A  ecotype,  is  not  a  member  of  this  clade.  Sequences 
specific  to  each  physiological  type  will  be  useful  in  probing  field  populations  to 
determine  the  spatial  and  temporal  distribution  of  the  ecotypes  of  Prochlorococcus. 

The  marine  cyanobacterium  Prochlorococcus  (Chisholm  et  al.  1988,  Chisholm  et  al. 

1992)  is  abundant  (often  105  cells  ml"1)  and  widespread  in  the  tropical  and  subtropical 
regions  of  the  open  ocean.  Prochlorococcus  is  found  throughout  the  euphotic  zone  and 
often  comprises  40-60%  of  the  total  chlorophyll  in  the  Pacific  (Chavez  et  al.  1991, 
Letelier  et  al.  1993,  Campbell  et  al.  1994,  Shimada  et  al.  1995),  the  north  Atlantic 
(Goericke  &  Welschmeyer  1993,  Veldhuis  et  al.  1993),  and  the  Red  Sea  and  Somalia 
Upwelling  region  (Veldhuis  &  Kraay  1993,  Lindell  &  Post  1995). 
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We  refer  to  Prochlorococcus  as  a  cyanobacterium,  despite  its  original  description  as  a 
“prochlorophyte”  (Chisholm  et  al.  1988)  —  referring  to  oxygenic  photosynthetic 
prokaryotes  that  possess  chi  b  and  lack  phycobilins  (Lewin  1981).  Molecular 
phylogenies  constructed  using  both  the  16S  ribosomal  RNA  gene  (16S  rDNA)  (Urbach  et 
al.  1992)  and  the  gene  for  the  large  subunit  of  RNA  polymerase  ( rpoCl )  (Palenik  & 
Haselkom  1992)  demonstrate  that  Prochlorococcus  is  most  closely  related  to  marine 
Synechococcus,  and  phylogenetically  distinct  from  other  prochlorophyte  lineages.  The 
close  relationship  between  Prochlorococcus  and  Synechococcus  is  also  apparent  in 
phylogenies  constructed  using  the  psbA  and  psbB  genes  and  portions  of  the  petB  and 
petD  genes  (Hess  et  al.  1995,  Urbach  et  al.  1998),  and  in  the  possession  of  phycoerythrin 
genes  by  some  strains  of  Prochlorococcus  (Hess  et  al.  1996,  Ting  et  al.  1998).  Further, 
Prochlorococcus  has  a  suite  of  pigments  distinctly  different  from  that  of  two  other 
previously  designated  prochlorophytes  Prochloron  and  Prochlorothrix  (Chisholm  et  al. 
1992).  It  is  now  clear  that  the  chi  b  phenotype  has  evolved  independently  at  least  four 
times  (La  Roche  et  al.  1996)  and  that  these  organisms,  along  with  traditional 
cyanobacteria,  should  probably  be  reclassified  as  Oxyphotobacteria  (Pinevich  et  al. 

1997).  Although  phylogenetic  analyses  using  the  gene  for  the  large  subunit  of  Rubisco 
(rbcL)  place  one  isolate  of  Prochlorococcus  with  the  gamma  proteobacteria  (Shimada  et 
al.  1995),  this  likely  reflects  a  horizontal  gene  transfer  event,  and  not  the  true  organismal 
phylogeny  (Delwiche  &  Palmer  1996).  Two  other  Prochlorococcus  isolates  possess  rbcL 
sequences  which  branch  within  the  cyanobacterial  clade  (Pichard  et  al.  1997)  and  one 
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strain  of  Synechococcus  has  a  proteobacterial-like  rbcL  sequence  as  well  (Watson  & 
Tabita  1996). 


Using  flow  cytometry  to  study  picoplankton  in  the  field,  we  and  others  have  observed 
distinct  co-occurring  populations  of  Prochlorococcus,  based  on  their  different  mean 
chlorophyll  fluorescence  intensities  (Campbell  &  Vaulot  1993,  Veldhuis  &  Kraay  1993, 
McManus  &  Dawson  1994,  Partensky  et  al.  1996).  Although  these  populations  could 
have  been  derived  from  the  mixing  together  of  genetically  identical  Prochlorococcus 
cells  acclimated  to  different  past  light  and/or  nutrient  regimes,  several  lines  of  evidence 
suggested  they  were  physiologically  and  genetically  distinct  populations.  The  latter 
interpretation  was  supported  by  physiological  experiments  defining  two  different  types  of 
Prochlorococcus  (Partensky  et  al.  1993,  Moore  et  al.  1995).  A  comparative  study  of  two 
isolates,  the  type  strain  SS120  (CCMP1375)  and  MED4  (CCMP1378),  demonstrated  that 
saturating  growth  irradiances  are  significantly  lower  for  SS120  than  for  MED4:  SS120  is 
photoinhibited  above  140  pE  m'2  s'1,  and  ratios  of  chlorophyll  b  to  divinyl  chlorophyll  a 
(chi  a2  (Goericke  &  Repeta  1993))  are  almost  an  order  of  magnitude  higher  in  SSI 20 
over  all  irradiances  measured  (Moore  et  al.  1995).  Consistent  with  these  data, 
biochemical  characterization  of  the  two  strains  indicates  that  the  pigment  complexes  of 
SS 1 20  are  bigger  and  more  abundant  at  low  light,  and  bind  seven  times  more  chi  b  than 
those  of  MED4  (Partensky  et  al.  1997). 
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The  coexistence  of  physiologically  and  genetically  distinct  types  of  Prochlorococcus  was 
definitively  shown  recently  when  we  flow-cytometrically  sorted  and  cultured  the  high 
and  low  fluorescence  cells  from  coexisting  Prochlorococcus  populations  in  two  locations 
of  the  North  Atlantic  Ocean  (Moore  et  al.  1998).  At  low  irradiances  the  high 
fluorescence  isolates,  MIT9303  and  MIT9313,  have  relatively  high  growth  rates  and  a 
high  chi  bla.2  ratio,  similar  to  SS120,  whereas  the  low  fluorescence  isolates,  MIT9302  and 
MIT9312,  have  a  lower  growth  rate  and  a  low  chi  bla.2  ratio,  similar  to  MED4  (Moore  et 
al.  1998). 


The  physiology  of  these  six  isolates  and  four  isolates  from  the  Pacific  Ocean  (MIT9201, 
MIT9202,  MIT921 1,  MIT9215)  suggests  the  genus  Prochlorococcus  can  be  divided  into 
at  least  two  ecotypes  based  on  pigment  content  and  light-dependent  physiological 
response  (Moore  &  Chisholm  1999).  Pigment  content  has  also  been  used  to  distinguish 
isolates  of  Synechococcus  (e.g.  high  and  low  PUB  types;  see  Waterbury  et  al.  1986).  The 
distinction  between  the  two  groups  of  Prochlorococcus  is  best  defined  using  their  chi 
bla2  ratios  (Fig.  1)  since  at  any  particular  irradiance  there  is  no  overlap  in  this  parameter. 
One  group  of  Prochlorococcus  isolates,  the  low  B/A  ecotype  (Moore  &  Chisholm  1999), 
cluster  with  chi  bla2  ratios  below  0.7  while  the  other  four  isolates,  the  high  B/A  ecotype, 
have  higher  chi  bla2  ratios,  but  exhibit  more  variability  between  them  (Fig.  1).  High  B/A 
isolates  are  also  characterized  by  their  higher  chlorophyll  (^-specific  light  harvesting 
efficiencies  and  spectrally-weighted  average  absorption  coefficients.  High  B/A  isolates 
also  show  decreases  in  growth  rate  due  to  high  light  exposure  at  light  levels  where  the 
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Figure  1.  Chlorophyll  bla2  ratio  of  different  Prochlorococcus  isolates  grown  over  a 
range  of  light  intensities  (for  isolation  information  see  Moore  &  Chisholm,  1999). 
The  isolates  can  be  grouped  loosely  into  two  clusters,  those  with  low  chi  bla2  ratios 
(low  B/A  ecotype)  and  those  with  high  bla2  ratios  (high  B/A  ecotype).  The  high  B/A 
ecotype  appears  to  have  two  sub-groups  within  it.  Data  from  Moore  &  Chisholm, 
1999. 
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low  B/A  isolates  grow  maximally.  Further,  a  secondary  division  can  be  made  within  the 
high  B/A  isolates:  when  grown  under  high  light  MIT9303  and  MIT9313  have  higher  cell- 
specific  light  harvesting  efficiencies  and  maximum  photosynthetic  rates  than  SS120  and 
MIT9211  (Moore  &  Chisholm  1999). 


Genetic  analysis  of  Prochlorococcus  isolates  indicates  that  there  is  a  correlation  between 
photophysiology  and  phylogeny.  Characterization  of  ten  isolates  by  RFLP  mapping 
(using  probes  to  photosynthetic  genes)  groups  them  into  two  clusters,  one  containing  the 
high  B/A  isolate  SS120  and  the  other  containing  the  low  B/A  isolate  MED4  (Scanlan  et 
al.  1996).  Analysis  of  16S  rDNA  sequences  from  these  isolates  (Urbach  et  al.  1998) 
reveals  a  well  supported  group,  deemed  the  “high-light  adapted  clade”  because  it  includes 
the  low  B/A  MED4  and  two  Pacific  isolates,  SB  and  GP2,  which  have  low  chi  b/a2  ratios 
(Shimada  et  al.  1996).  Support  for  this  clade  is  further  strengthened  by  analyses 
demonstrating  that  the  low  B/A  isolates  MIT9302  and  MIT9312  (each  from  a  coexisting 
pair  of  isolates)  are  also  members  of  this  clade  (Moore  et  al.  1998). 

The  16S  rDNA  sequences  of  four  Prochlorococcus  isolates  from  the  Pacific  Ocean 
further  support  the  correlation  of  photophysiology  with  phylogeny.  On  the  basis  of  their 
photophysiology  three  of  these  isolates  (MIT9201,  MIT9202  and  MIT9215)  belong  to  the 
low  B/A  ecotype,  while  the  fourth  (MIT921 1)  is  characterized  as  high  B/A  (Moore  & 
Chisholm  1999).  The  three  low  B/A  isolates  are  more  than  99%  similar  to  each  other  in 
their  16S  rDNA  sequence  and  cluster  within  the  “high-light  adapted  clade”  (Fig.  2).  One 
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Figure  2.  Phylogenetic  relationships  of  Prochlorococcus  and  Synechococcus  isolates 
and  environmental  sequences  (SAR)  from  the  Sargasso  Sea  inferred  from  16S  rRNA 
gene  sequences.  Sequences  of  Pacific  isolates  MIT9201,  MIT9202,  MIT9211  and 
MIT9215  (in  bold)  were  determined  using  previously  described  methods  .  All 
analyses  (tree  construction  and  similarity  values  cited  in  text)  employed  1070 
unambiguously  determined  and  aligned  bases  of  the  16S  rDNA.  The  phylogenetic 
framework  was  inferred  by  a  distance  analysis,  using  minimum  evolution  as  the 
objective  function.  Bootstrap  values  from  distance  and  parsimony  analyses  (above 
and  below  the  branches,  respectively)  are  presented  out  of  100  resampled  datasets; 
values  below  50  are  not  shown.  The  freshwater  cyanobacterium  Synechococcus 
PCC6301  was  used  to  root  the  tree. 
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isolate,  MIT9202,  has  a  sequence  identical  to  those  of  two  other  isolates,  TATL2,  from 
the  Tropical  Atlantic,  and  SB  from  the  West  Pacific,  corroborating  previous  results 
demonstrating  that  Prochlorococcus  from  different  oceanic  regimes  can  be  genetically 
similar  (Urbach  et  al.  1998).  Although  all  of  the  isolates  which  cluster  in  the  low  B/A 
clade  are  closely  related  (at  least  98.8%  similar  to  one  another),  there  is  a  well  supported 
subdivision  within  this  clade.  One  of  these  subgroups  contains  MED4,  which  has  the 
lowest  chi  b/ci2  ratio  of  all  the  isolates  we  have  characterized  (Fig.l),  although  no  further 
physiological  distinction  between  the  two  subgroups  is  apparent. 

Congruent  with  the  larger  degree  of  physiological  variability  among  isolates  of  the  high 
B/A  ecotype,  they  also  exhibit  greater  genetic  variability  than  the  low  B/A  isolates.  The 
16S  rDNA  sequence  similarities  of  MIT9211  with  SS120,  MIT9313  and  MIT9303  are 
98.6%,  98.2%  and  97.8%  respectively.  These  similarities  are  lower  than  those  between 
the  low  B/A  isolates,  and  are  only  slightly  greater  than  the  similarities  between  MIT921 1 
and  Synechococcus  WH8103  (97.4%).  The  high  B/A  isolates  MIT9303  and  MIT9313 
share  even  higher  sequence  similarities  to  Synechococcus  (Moore  et  al.  1998).  The  high 
B/A  isolates  do  not  cluster  together  in  a  single  clade  but  are  on  separate,  basal  branches 
of  the  tree  (Fig.  2),  consistent  with  the  possible  subdivisions  within  this  ecotype 
suggested  by  their  photophysiology. 

Future  directions  include  further  consideration  of  the  taxonomic  status  of  the  ecotypes  we 
have  described.  It  is  possible  that  the  high  and  low  B/A  ecotypes  represent  different 
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species  of  Prochlorococcus.  However,  a  definitive  determination  will  require  either 
sequence  data  from  a  more  variable  locus  to  provide  more  resolution  between  the  high 
B/A  isolates  and  marine  Synechococcus,  or  axenic  cultures  of  each  ecotype  in  order  to 
perform  other  taxonomic  analyses  such  as  DNA-DNA  hybridizations  and  determinations 
of  %  GC  content  and  fatty  acid  composition  (Wayne  et  al.  1987,  Priest  &  Austin  1993, 
Stackebrandt  &  Goebel  1994,  Tamaoka  1994).  At  present  we  adopt  the  convention  used 
for  marine  Synechococcus  (Waterbury  et  al.  1986)  and  designate  our  isolates  simply  as 
strains  of  the  genus  Prochlorococcus.  Although  we  have  definitively  described  two 
ecotypes,  we  consider  this  a  minimum  number.  As  described  above,  there  are 
undoubtedly  subdivisions  within  both  the  high  and  low  B/A  ecotypes.  Future 
investigations  will  likely  reveal  additional  ecotypes  not  currently  represented  in  culture 
and/or  further  ecotypic  differentiation  based  on  susceptibility  to  cyanophages,  nutrient 
utilization  efficiencies,  or  trace  metal  toxicity  (Mann  et  al.  1997). 

A  second  direction  we  are  actively  pursuing  is  determining  the  distributions  of  the  two 
ecotypes  in  the  field.  It  has  been  suggested  that  low-light  adapted,  high  chi  bla2 
Prochlorococcus  predominate  in  the  deeper  portion  of  the  euphotic  zone  and  that  high¬ 
light  adapted,  low  chi  b/a2  Prochlorococcus  predominate  in  surface  waters  (Campbell  & 
Vaulot  1993,  Goericke  &  Repeta  1993,  Moore  et  al.  1995).  Analyses  of  natural 
populations,  by  examining  sequences  from  environmental  clone  libraries  (Ferris  & 
Palenik  1998)  and  using  molecular  probes  designed  to  take  advantage  of  the  correlation 
of  physiology  with  phylogeny  (N.  West  &  D.  Scanlan,  pers.  comm.),  are  consistent  with 
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this  hypothesized  distribution.  Repeated  observations  of  multiple  phylotypes  in  single 
water  samples  (Palenik  1994,  Ferris  &  Palenik  1998,  Urbach  &  Chisholm  1998)  indicate 
that  further  work  is  needed  to  determine  how  the  ecotypes  are  distributed  under  different 
environmental  conditions.  It  is  clear  that  the  distribution  of  multiple  Prochlorococcus 
ecotypes  in  the  same  water  column  would  result  in  greater  integrated  production  than 
could  be  achieved  by  a  single  ecotype  (Moore  et  al.  1998).  The  existence  of  multiple 
ecotypes  allows  for  survival  over  a  broader  range  of  conditions  than  could  be  achieved  by 
a  physiologically  and  genetically  homogenous  population  and  undoubtedly  contributes  to 
the  dominance  of  Prochlorococcus  in  the  world’s  oceans. 
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CHAPTER  FOUR 


Resolution  of  Prochlorococcus  and  Synechococcus  ecotypes  based  on 
pigment  analysis  and  16S/23S  rDNA  Internal  Transcribed  Spacer  (ITS) 

region  sequences 
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ABSTRACT 


Cultured  isolates  of  the  marine  picoplankton  Prochlorococcus  and  Synechococcus  vary 
widely  in  their  pigment  compositions  and  growth  responses  to  light  and  nutrients.  In 
order  to  better  define  the  genetic  variation  that  accompanies  this  physiological  diversity, 
sequences  for  the  16S  23S  rDNA  internal  transcribed  spacer  region  (ITS)  were 
determined  in  30  Prochlorococcus  isolates  and  14  marine  A  Synechococcus  isolates  from 
around  the  globe.  All  strains  examined  yielded  a  single  ITS  sequence  which  contained 
two  tRNA  genes.  Dramatic  variations  in  the  length  and  %GC  content  of  the  spacer  were 
observed  among  the  strains,  particularly  among  Prochlorococcus.  The  44  strains  can  be 
grouped  into  ten  sequence  similarity  clusters,  each  of  which  likely  represents  an 
ecologically  distinct  population.  The  ITS  enabled  the  identification  of  four  clusters 
within  marine  A  Synechococcus  strains,  and  six  clusters  of  Prochlorococcus,  two  within 
the  low  B/A  ecotype  and  four  within  the  high  B/A  ecotype.  A  better  understanding  of 
physiological  and  phylogenetic  diversity  in  cultured  isolates  provides  an  important 
framework  for  studies  which  draw  ecological  conclusions  about  natural  populations 
based  solely  on  DNA  sequence  data. 

INTRODUCTION 

In  open  ocean  ecosystems,  carbon  fixation  is  dominated  by  the  marine  cyanobacteria 
Prochlorococcus  and  Synechococcus.  Together  they  have  been  shown  to  contribute 
between  32  -  80%  of  the  primary  production  in  the  oligotrophic  oceans  (Goericke  and 
Welschmeyer  1993,  Li  1995,  Liu,  et  al.  1997,  Veldhuis,  et  al.  1997).  Prochlorococcus  is 
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closely  related  to  marine  A  Synechococcus  based  on  analyses  using  gene  sequences  from 
16S  ribosomal  RNA  (16S  rDNA)  and  a  subunit  of  DNA-dependent  RNA  polymerase 
(, rpoCl )  (Palenik  1992,  Urbach,  et  al.  1992).  However,  Prochlorococcus  contains  divinyl 
chlorophyll  a  (8-desethyl,  8-vinyl  chlorophyll  a,  or  “chi  a2”)  and  divinyl  chlorophyll  b 
(chi  b2)  as  its  major  photosynthetic  pigments  rather  than  the  phycobilisomes  that  are 
typical  of  cyanobacteria  (Chisholm,  et  al.  1988,  Chisholm,  et  al.  1992). 

Cultured  isolates  of  Prochlorococcus  have  been  divided  into  two  genetically  and 
physiologically  distinct  groups,  referred  to  as  ecotypes  because  their  differing 
physiologies  have  implications  for  their  ecological  distributions  (Moore,  et  al.  1998, 
Moore  and  Chisholm  1999,  Rocap,  et  al.  1999).  High  B/A  isolates  have  larger  ratios  of 
chi  b/a2  and  are  able  to  grow  at  extremely  low  irradiances  (less  than  5  (imol  Q  m'2  s'1) 
where  low  B/A  isolates  are  incapable  of  growth.  Low  B/A  isolates  have  lower  chi  b/a2 
ratios  and  are  able  to  grow  maximally  at  higher  light  intensities,  where  high  B/A  isolates 
are  inhibited  (Moore  and  Chisholm  1999).  The  ecotypes  also  differ  in  their  sensitivity  to 
copper  toxicity,  with  low  B/A  isolates  able  to  grow  at  free  cupric  ion  concentrations  five 
times  higher  than  those  high  B/A  isolates  can  tolerate  (Mann  2000). 

Thel6S  rDNA  sequences  of  the  Prochlorococcus  ecotypes  correlate  with  their 
physiology.  Strains  of  the  low  B/A  ecotype  are  phylogenetically  very  closely  related 
(99%  similarity  in  16S  rDNA  sequence)  and  cluster  in  a  tight  clade  (Moore,  et  al.  1998, 
Urbach,  et  al.  1998,  Rocap,  et  al.  1999).  Strains  of  the  high  B/A  ecotype  are  less  similar 
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in  their  16S  rDNA  sequence  (97-98%)  and  do  not  form  a  monophyletic  group  (Rocap,  et 
al.  1999).  High  B/A  Prochlorococcus  also  have  a  higher  degree  of  sequence  similarity  to 
marine  Synechococcus  than  the  low  B/A  ecotype.  In  fact,  branching  orders  between 
some  high  B/A  Prochlorococcus  isolates  and  Synechococcus  are  not  well  resolved  using 
16S  rDNA  sequences  (Moore,  et  al.  1998,  Urbach,  et  al.  1998,  Rocap,  et  al.  1999). 

Open  ocean  Synechococcus  consist  of  a  genetically  and  physiologically  diverse  set  of 
strains.  All  contain  phycoerythrin  as  their  major  light-harvesting  pigment,  and  some 
possess  the  chromophore  phycourobilin  (PUB)  which  can  attach  to  phycoerythrin  in 
place  of  phycoerythrobilin  (PEB)  (Ong  and  Glazer  1991).  The  relative  amounts  of  PUB 
and  PEB  vary  among  strains.  In  addition,  some  isolates  are  capable  of  a  novel  form  of 
swimming  motility  (Waterbury,  et  al.  1985,  Waterbury,  et  al.  1986).  Synechococcus  also 
vary  in  the  mol  percent  G+C  content  of  their  genomes,  and  in  the  ability  to  utilize  organic 
nutrient  sources  such  as  urea  and  cyclic  AMP  (Waterbury,  et  al.  1986). 

Genetic  diversity  in  Synechococcus  has  been  examined  in  a  few  strains  using  16S  rDNA 
sequences  (Urbach,  et  al.  1998)  and  more  extensively  using  rpoCl  sequences  (Toledo 
and  Palenik  1997,  Toledo,  et  al.  1999).  Using  rpoCl  a  collection  of  strains  from  the 
California  Current  could  be  divided  into  two  lineages  consistent  with  their  high  or  low 
PUB  amounts.  However,  each  of  these  lineages  was  distinct  from  the  typical  laboratory 
model  high  and  low  PUB  strains  WH8103  and  WH7803,  suggesting  that  pigment  content 
alone  may  not  resolve  the  multiple  ecotypes  of  marine  Synechococcus.  Motility, 
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however,  does  appear  to  be  correlated  with  phylogeny,  as  all  motile  isolates  characterized 
to  date  are  closely  related  (Toledo,  et  al.  1999). 

Clusters  of  strains  that  are  closely  related  genetically,  as  observed  in  Prochlorococcus 
and  Synechococcus,  are  a  common  feature  in  the  microbial  world  (Ward,  et  al.  1998).  It 
has  been  suggested  that  such  sequence  similarity  clusters  should  be  at  the  heart  of  a 
natural  species  concept  for  bacteria  (Ward  1998),  and  that  ecologically  neutral  mutations 
in  rDNA  or  other  “housekeeping  genes”  are  excellent  markers  for  distinct  ecotypes 
(Palys,  et  al.  1997).  In  practice,  however,  as  in  the  case  of  high  B/A  Prochlorococcus 
and  Synechococcus,  16S  rDNA  can  be  too  conserved  to  adequately  resolve  closely 
related  yet  ecologically  distinct  strains. 

In  most  eubacteria  the  genes  for  ribosomal  RNA  are  organized  in  operons  with  the  genes 
encoding  the  5S,  16S  and  23S  rRNAs  separated  by  internal  transcribed  spacer  (ITS) 
regions.  The  spacer  regions  probably  have  a  role  in  holding  secondary  structure  of 
preRNA  for  processing  to  mature  rRNAs  (Apirion  and  Miczak  1993).  The  spacer 
between  the  16S  and  23S  rRNA  genes  can  encode  0,  1,  or  2  tRNA  genes  and  exhibit  a 
great  deal  of  length  and  sequence  variation.  The  ITS  has  been  used  in  many  bacterial 
groups  to  delineate  closely  related  strains  (Barry,  et  al.  1991,  Giirtler  and  Stanisich  1996, 
Leblond-Bourget,  et  al.  1996,  Christensen,  et  al.  1999).  Because  there  is  only  one  rRNA 
operon  in  Prochlorococcus,  the  ITS,  like  the  16S  rDNA,  is  present  in  a  single  copy 
(Urbach,  et  al.  1998,  West  and  Scanlan  1999). 
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Here  we  report  on  use  of  the  ITS  as  phylogenetic  tool  to  identify  strain  clusters  which 
may  represent  ecologically  distinct  populations  of  marine  cyanobacteria.  Although 
isolates  in  culture  collections  may  not  represent  the  full  extent  of  diversity  due  to  biases 
introduced  by  isolation  protocols,  they  provide  crucial  physiological  information  to  attach 
to  the  strain  clusters.  By  examining  a  wide  range  of  physiologically  diverse  isolates  of 
Prochlorococcus  and  Synechococcus  we  hope  to  lay  the  groundwork  for  informed  studies 
of  genetic  diversity  and  distributions  in  field  populations  of  marine  cyanobacteria. 

MATERIALS  AND  METHODS 

Isolation  of  strains.  Thirty  isolates  of  Prochlorococcus  from  diverse  oceanic  regimes 
were  employed  in  this  study  (Table  1).  Isolation  conditions,  physiology  and  genetic  data 
have  been  reported  previously  for  many  of  the  strains  (see  Table  1).  The  majority  were 
isolated  by  filtering  seawater  through  two  stacked  0.6  pm-pore-size  filters  and  enriching 
with  nutrients  (Chisholm,  et  al.  1992).  Five  were  isolated  by  sorting  on  a  flow  cytometer 
(Moore,  et  al.  1998).  Seven  of  the  strains  (SSI 20,  SS35,  SS51,  SS2,  MED4,  SB,  and 
GP2)  have  been  rendered  clonal  by  serial  dilution  and  one  (MED4Ax)  has  been  rendered 
free  of  heterotrophic  bacteria  by  plating  (M.  Saito  and  J.  Waterbury,  unpublished  data). 

The  30  strains  of  Prochlorococcus  include  the  10  isolates  characterized  by  Moore  and 
Chisholm  (1999),  the  15  isolates  described  in  this  work  and  5  additional  isolates.  These  5 
additional  isolates  are  clonal  derivatives  of  previously  characterized  strains.  The  original 
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Table  1.  Prochlorococcus  isolates  used  in  this  study. 


OTHER 

DATE 

ISOLATION 

ORIGINAL 

ISOLATES 

NAMES 

COORDINATES 

DEPTH 

ISOLATED 

METHOD 

REFERENCE 

Sargasso  Sea 
SARG 

LG,  SSW5 

29°N;  64.4°W 

120  m 

May  ’88 

filtered  enrichment 

(Chisholm,  et  al.  1992) 

SSI  20 

CCMP1375 

ii 

M 

clonal  by  dilution 
from  SARG 

(Chisholm,  et  al.  1992) 

SS35 

CCMP1428 

" 

" 

li 

<( 

“ 

SS51 

CCMP1376 

II 

n 

it 

“ 

M 

SS2 

CCMP1427 

" 

n 

" 

M 

“ 

MIT9301 

34.2°N;  66.3°W 

90  m 

Jul ’93 

filtered  enrichment 

this  work 

MIT9302 

34.8°N;  66.2°W 

100  m 

Jul  ’93 

sorted  on  FCM 

(Moore,  et  al.  1998) 

MIT9303 

MIT9401 

35.5°N;  70.4°W 

Surface 

May  ’94 

sorted  on  FCM 
filtered  enrichment 

(Moore,  et  al.  1998) 
this  work 

Gulf  Stream 

MIT9311 

37.5°N;  68.2°W 

135  m 

Jul  ’93 

sorted  on  FCM 

this  work 

MIT9312 

MIT9313 

MIT9314 

37.5°N;  68.2°W 

li 

180  m 

li 

sorted  on  FCM 
sorted  on  FCM 
filtered  enrichment 

(Moore,  et  al.  1998) 
(Moore,  et  al.  1998) 
this  work 

North  Atlantic 

NATL  1 A 
NATL2A 

37.4°N;  40°W 
39°N;  49.3°W 

30  m 

10  m 

Apr.  ‘90 

filtered  enrichment 
filtered  enrichment 

(Partensky,  et  al.  1993) 
(Scanlan,  et  al.  1996) 

_ _ _ _ (Urbach,  et  at.  1998) 

Mediterranean  Sea 


MED  DV1 

MED4  CCMP1378 

MED4Ax 

43.2°N;  6.9°E 

<< 

5  m 

(( 

Jan  ’89 

filtered  enrichment 
clonal  by  dilution 
from  MED 
axenic,  cloned  from 
MED4  by  plating 

(Moore,  et  al.  1995) 

Saito  et  al  in  prep 

South  Pacific 

MIT9107 

15°S;  135°W 

25  m 

Aug  ’91 

filtered  enrichment 

(Urbach,  et  al.  1998) 

MIT91 16 

" 

ll 

" 

filtered  enrichment 

this  work 

MIT9123 

" 

M 

" 

filtered  enrichment 

this  work 

MIT9201 

12°S;  145.4°W 

Surface 

Sep  ’92 

filtered  enrichment 

(Moore  and  Chisholm  1999) 

MIT9202 

" 

79  m 

" 

filtered  enrichment 

(Moore  and  Chisholm  1999) 

Eauatorial  Pacific 

MIT9211 

0°;  140°W 

~83  m 

Apr  ’92 

filtered  enrichment 

(Moore  and  Chisholm  1999) 

MIT9215 

0°;  140°W 

Surface 

Oct  ’92 

filtered  enrichment 

(Moore  and  Chisholm  1999) 

MIT9321 

1°N;  92°W 

50  m 

Nov  ’93 

filtered  enrichment 

this  work 

MIT9322 

0.3°N;  93 °W 

Surface 

Nov  ’93 

filtered  enrichment 

this  work 

MIT9515 

5.7°S;  107. 1°W 

15  m 

Jun  ’95 

filtered  enrichment 

this  work 

Western  Pacific 

GP2 

8°N;  136°E 

150m 

Sep.  ‘92 

filtered  enrichment. 

(Shimada,  et  al.  1995a) 

clonal  by  dilution 

SB 

35°N;  138. 3°E 

40m 

Oct.  ‘92 

filtered  enrichment. 

(Shimada,  et  al.  1995b) 

clonal  by  dilution 

Arabian  Sea 

AS9601 

19°S;  67°E 

50  m 

Nov  ‘95 

filtered  enrichment 

(Shalapyonok,  et  al.  1998) 
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isolate  of  Prochlorococcus,  S  ARG,  which  is  the  parent  strain  of  the  high  B/A  isolate 
SSI  20  (the  type  strain  of  Prochlorococcus  marinus),  has  also  given  rise  to  an  isolate  with 
genetic  and  physiological  properties  which  place  it  in  the  low  B/A  ecotype  (Rippka,  et  al. 
2000).  To  investigate  the  potential  heterogeneity  of  the  SARG  culture,  SARG  and  3  of 
its  other  clonal  derivatives  were  selected  for  sequencing.  Because  it  will  be  the  subject  of 
many  future  investigations,  Prochlorococcus  strain  MED4Ax,  rendered  axenic  by 
picking  a  single  colony  from  an  agar  plate  (Saito  and  Waterbury,  unpublished  data),  was 
also  sequenced  in  order  to  compare  it  with  its  parent  strain  MED4. 

The  isolation  and  culture  history  of  isolates  NATL1A  and  NATL2A  is  convoluted 
enough  to  warrant  a  special  mention.  Isolates  NATL1  and  NATL2  were  isolated  from 
the  North  Atlantic  in  1990  and  kindly  provided  to  the  MIT  Prochlorococcus  culture 
collection  by  Fred  Partensky  shortly  thereafter.  NATL1  was  originally  reported  to  have  a 
chi  b/a2  ratio  and  light  dependent  growth  responses  which  placed  it  in  the  high  B/A 
ecotype  (Morel,  et  al.  1993,  Partensky,  et  al.  1993).  However,  the  isolates  maintained  in 
the  Roscoff  Culture  Collection  of  Marine  Phytoplankton  (France)  changed  over  time  to 
have  a  phenotype  similar  to  the  low  B/A  ecotype  (described  in  Rippka,  et  al.  2000).  The 
altered  phenotypic  properties  are  consistent  with  RFLP  and  16S  rDNA  analyses 
demonstrating  that  both  Roscoff  isolates  are  closely  related  to  the  low  B/A  isolate  MED4 
(Scanlan,  et  al.  1996,  Urbach,  et  al.  1998,  Rippka,  et  al.  2000).  In  contrast  the  isolates 
maintained  at  MIT  continued  to  maintain  a  phenotype  characteristic  of  the  high  B/A 
ecotype  (L.  Moore  unpublished  observations).  The  MIT  NATL2  isolate  yielded  a  16S 
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rDNA  sequence  which  branched  outside  the  low  B/A  clade  and  was  2.5%  different  than 
the  sequence  from  the  Roscoff  NATL2  (Urbach,  et  al.  1998).  The  MIT  culture  was 
designated  NATL2A  (Urbach,  et  al.  1998)  and  in  parallel,  we  refer  to  the  NATL1  culture 
maintained  at  MIT  as  NATL  1 A  to  distinguish  it  from  the  isolate  maintained  at  Roscoff. 

Fifteen  of  sixteen  Synechococcus  isolates  used  in  this  study  (Table  2)  are  clonal  and  have 
been  described  previously  (Waterbury,  et  al.  1986).  Strain  WH9908  was  isolated  from 
Woods  Hole  in  April  1999  (by  M.  Sullivan),  when  the  water  temperature  was  less  than 
10°  C  and  rendered  clonal  by  picking  a  single  colony  from  an  agar  plate.  Of  the  16 
isolates,  14  are  phycoerythrin-containing  marine  A  cluster  strains.  One  freshwater  strain 
assigned  to  the  Cyanobium  cluster  (PCC6307)  and  one  strain  from  the  marine  B  cluster 
(WH8101)  are  included  as  outgroups  (Table  2). 

Culture  conditions.  For  physiology  experiments,  20  ml  batch  cultures  of 
Prochlorococcus  were  grown  in  acid  washed  50  ml  test  tubes  at  24°C  on  14:10  light:dark 
cycle  under  18  pmol  Q  m 2  s  1  white  light.  This  light  level  is  roughly  equivalent  to  1%  of 
surface  irradiance,  which  corresponds  to  a  depth  of  ~100m  (assuming  typical  oligotrophic 
water  values  for  surface  irradiance  of  L,  =  2000  pmol  Q  m'2  s'1  and  an  extinction 
coefficient  of  k  =  0.045  m'1  (Kirk  1994)).  This  growth  irradiance  was  chosen  based  on 
physiological  data  from  ten  other  Prochlorococcus  isolates  (Moore  and  Chisholm  1999), 
because  it  was  a  level  at  which  all  were  capable  of  growth  and  one  where  ecotypic 
differences  in  the  chi  b/a,2  ratio  were  pronounced. 
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Table  2.  Synechococcus  isolates  used  in  this  study 


STRAIN 

ISOLATION 

LOCATION 

DATE 

PE 

A495/ 

A545 

GENOME 

%GC 

MOTILE 

Cvanobium  cluster 

PCC6307  Wisconsin  Lake 

1949 

NA 

69.7 

Marine  B  cluster 

WH8 101  W oods  Hole 

1981 

NA 

63.9 

Marine  A  cluster 

WH6501  8.7°N,  50.8°W 

1965 

+ 

0.43 

62.3 

WH7803 

33.7°N,  67.5°W 

July  ‘78 

+ 

0.39 

61.3 

_ 

WH7805 

33.7°N,  67.5°W 

June  ‘78 

+ 

no  PUB 

59.7 

_ 

WH8002 

19.7°N,  92.4°W 

Apr.  ‘80 

+ 

0.48 

60 

_ 

WH8008 

19.7°N,  92.4°W 

Apr.  ‘80 

+ 

no  PUB 

55.8 

_ 

WH8012 

34°N,  65°W 

Jul.  ‘80 

+ 

0.4 

62.4 

_ 

WH8016 

Woods  Hole 

June  ‘80 

+ 

0.4 

55.5 

. 

WH8017 

Woods  Hole 

June  ‘80 

+ 

0.42 

54.5 

WH8018 

Woods  Hole 

June  ‘80 

+ 

no  PUB 

57.6 

_ 

WH8103 

28.5°N,  67.4°W 

Mar.  ‘81 

+ 

2.4 

58.9 

+ 

WH8109 

39.5°N,  70.5°W 

Jun. ‘81 

+ 

0.89 

? 

_ 

WH8112 

36°N,  66°W 

Oct.  ‘81 

+ 

variable 

59.8 

+ 

WH8113 

36°N,  66°W 

Oct.  ‘81 

+ 

variable 

60.5 

+ 

WH9908 

Woods  Hole 

Apr.  ‘99 

+ 

? 

? 

_ 

Data  in  table  adapted  from  Waterbury  et  al.  (1986).  PE  indicates  the  presence  or  absence  of 
phycoeryrthrin.  A495/A545  ratio  is  a  measure  of  relative  amounts  of  pigments  phycourobilin  (PUB)  to 
phycoerythrobilin  (PEB). 


Media  was  made  from  0.2  Jim  filtered,  autoclaved  Sargasso  Sea  water  enriched  with 
“Pro2”  nutrients  (final  concentrations  lOpM  NaH2  P04,  50pM  NH4C1,  IOOjiM  urea,  1.17 

JIM  EDTA,  8nM  Zn,  5  nM  Co,  90  nM  Mn,  3  nM  Mo,  10  nM  Se,  10  nM  Ni,  and  1.17  ^M 
Fe)  (Moore  and  Chisholm  1999). 


For  DNA  extraction  Prochlorococcus  were  grown  in  60  ml  acid  washed  polycarbonate 
bottles  using  the  media  and  culture  condition  described  above.  Synechococcus  were 
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grown  for  DNA  extraction  in  100ml  acid  washed  flasks  in  SN  media  under  constant  light 
(Waterbury,  et  al.  1986). 

Growth  rate,  pigment  and  flow  cytometric  measurements.  The  physiology  of  fifteen 
isolates  of  Prochlorococcus  was  examined  by  measuring  their  growth  rate,  pigment 
content  and  flow  cytometric  properties.  All  physiology  experiments  were  performed  in 
triplicate  (although  the  chi  b/a2  ratio  and  total  chlorophyll  values  presented  for  MIT9321 
represent  the  mean  of  duplicate  cultures).  Cells  were  acclimated  to  the  experimental 
conditions  for  at  least  ten  generations  before  measurements  were  taken.  Growth  rate  was 
determined  by  measuring  the  fluorescence  of  each  culture  at  the  same  time  of  day  in  a 
fluorometer  (Turner  model  10-AU). 

For  pigment  measurements  a  known  volume  (18-22  ml)  of  exponential  phase  culture  was 
filtered  onto  a  25  mm  Whatman  GF/F  under  low  vacuum.  Filters  were  stored  in  liquid 
nitrogen  until  extraction.  Pigments  were  extracted  according  to  established  protocols 
(Moore,  1997,  adapted  from  Goericke  and  Repeta,  1993),  and  quantified  on  a 
spectrophotometer  (Becton  Dickinson  DU640).  Unlike  HPLC,  spectrophotometric 
methods  cannot  resolve  divinyl  chlorophyll  b2  from  “normal”  monovinyl  chlorophyll  b, 
thus  total  ( bi  +  bi)  values  are  reported.  Pigment  concentrations  were  calculated 
according  to  the  trichromatic  equations  of  Jeffrey  and  Humphreys  (1975). 


59 


For  flow  cytometry,  1  ml  of  exponential  phase  culture  was  preserved  in  glutaraldehyde 
(final  concentration  0.125%)  and  stored  in  liquid  nitrogen  until  cell  numbers  and  relative 
red  fluorescence  were  determined  by  running  samples  on  modified  FACScan  flow 
cytometer  (Dusenberry  and  Frankel  1994).  Fluorescence  per  cell  is  reported  relative  to 
0.474  pm  diameter  yellow-green,  polystyrene  microspheres  (Polysciences,  Inc. 
Warrington,  PA).  Data  were  analyzed  using  Cyclops  software  (Cicero). 

DNA  isolation,  PCR  and  sequencing.  DNA  was  extracted  from  50  ml  of  late 
exponential  phase  cultures  using  a  modified  CTAB/phenol  chloroform  protocol  (Ausubel, 
et  al.  1992).  The  ITS/23S  fragment  was  amplified  using  primers  16S-1247f 
(CGTACT  AC  A  ATGCT  ACGG)  and  23S-1608r  (CYACCTGTGTCGGTTT).  Primer 
16S-1247f  was  designed  using  availiablel6S  rDNA  sequences  from  Prochlorococcus  and 
Synechococcus  (Rocap,  et  al.  1999)  and  is  a  perfect  match  only  to  cyanobacterial  16S 
rDNA  sequences  as  judged  by  using  the  CHECK  PROBE  function  of  the  latest  release  of 
the  Ribosomal  Database  Project  (Maidak,  et  al.  1999).  Reactions  were  done  in  25  pi 
volume  with  final  concentrations  of  reactants  as  follows:  0.25  mM  dNTPs,  0.1  mM  each 
primer,  0.1-1  pg  template  DNA  and  0. 1-0.5  U  of  the  high  fidelity  polymerase  Pfu. 

Cycling  parameters  were  94°  for  4  min  followed  by  30  cycles  of  (94°  1  min,  52°  1  min, 

72  6  min)  and  a  final  extension  at  72°  for  10  mm  using  either  a  robocycler  (Stratagene, 
La  Jolla  CA)  or  a  PTC  100  (MJ  Research).  Amplified  fragments  were  visualized  on 
agarose  gels.  Only  one  band  was  observed  from  each  culture.  Control  reactions  lacking 
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template  DNA  were  always  performed  in  parallel  and  gave  no  products,  as  judged  by 
inspection  on  agarose  gels. 

For  sequencing,  PCR  reactions  were  performed  in  quintuplicate,  pooled,  and  primers 
removed  using  Strataprep  columns  (Stratagene).  Products  were  sequenced  on  ABI377  or 
ABI310  (PE  BlOsystems)  automated  sequencers  using  Big  Dye  terminator  sequencing 
kits  according  to  the  manufacturer’s  instructions.  The  ITS  was  sequenced  bidirectionally 
using  primer  16S-1247F,  and  primers  internal  to  the  PCR  fragment:  ITS-alaf 
(TWT  AGCT  C  AGTT  GGT  AG  AG) ,  ITS-alar  (CTCTACCAACTGAGCTAWA)  and  23S- 
241r  (TTCGCT CGCCRCT ACT) . 

Phylogenetic  analysis.  Sequences  were  edited  and  aligned  manually  using  the  Genetic 
Data  Environment  (Smith,  et  al.  1994).  Phylogenetic  analyses  and  calculation  of 
fractional  similarities  and  %  GC  contents  used  PAUP*  version  4b2a  (S wofford  1999). 
Phylogenetic  analyses  employed  either  103  or  146  positions  of  the  16S-tRNAUe  spacer. 

Distance  trees  were  inferred  using  minimum  evolution  as  the  objective  criterion  and 
paralinear  (logdet)  or  HKY85  distances.  Distance  and  maximum-parsimony  bootstrap 
analyses  (1000  resamplings)  were  performed  with  heuristic  searches  utilizing  random 
addition  and  tree-bisection  reconnection  (TBR)  branch-swapping  methods.  Maximum 
likelihood  analyses  used  the  HKY85  model  of  nucleotide  substitution  with  rate 
heterogeneity  and  empirical  nucleotide  frequencies.  The  gamma  shape  parameter  and  the 
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transition/transversion  ratio  were  initially  estimated  from  a  distance  topology  and  refined 
by  iterative  likelihood  searches.  Likelihood  bootstrap  analyses  (100  resamplings)  were 
performed  with  heuristic  searches  and  tree-bisection  reconnection  (TBR)  branch¬ 
swapping  methods  starting  from  neighbor  joining  tree.  Phylogenetic  trees  were 
visualized  with  TREEVDEW  (Page  1996). 

RESULTS 

Physiology  of  Prochlorococcus  strains.  Fifteen  isolates  of  Prochlorococcus  were 
grown  at  18  |imol  Q  m'2  s'1  (roughly  equivalent  to  the  1  %  light  level)  and  their  chi  b/a2 
ratio  was  measured.  Growth  rates  of  the  isolates  ranged  between  0.3  -  0.5  day'1  (Fig. 

1A).  Chi  b/a2  ratios  for  thirteen  of  the  isolates  ranged  between  0.2  and  0.5  while  two, 
NATL  1 A  and  NATL2A,  had  much  larger  ratios  (0.86  and  0.97  respectively)  (Fig.  2B). 

In  a  previous  study  six  low  B/A  isolates  had  chi  b/a2  ratios  of  0.2  -  0.5  at  comparable 
irradiance  levels,  while  the  four  isolates  of  the  high  B/A  ecotype  had  chi  b/ci2  ratios  which 
ranged  from  0.9  -  1.5  (Moore  and  Chisholm  1999).  Thus,  these  results  suggest  that 
NATL1A  and  NATL2A  belong  to  the  high  B/A  ecotype,  while  the  other  thirteen  isolates 
characterized  here  are  of  the  low  B/A  ecotype. 

Total  chlorophyll  per  cell  observed  in  these  isolates  was  well  within  the  range  seen 
previously  (1-9  fg  cell'1)  (Fig.  1C),  as  was  the  flow  cytometrically  derived  chlorophyll 
(red)  fluorescence  (Fig.  ID)  (Moore  and  Chisholm  1999).  Differences  in  flow 
cytometrically  derived  fluorescence  have  been  observed  in  natural  populations  (Campbell 
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Figure  1.  Growth  rate,  pigment  and  fluorescence  properties  of  fifteen  isolates  of 
Prochlorococcus  grown  at  18  pmol  Q  m'2  s'1.  Values  are  means  (±  1SE)  of  triplicate 
cultures.  (A)  Growth  rate.  (B)  Chlorophyll  b/a2  ratio.  (C)  Total  cellular  chlorophyll 
(a2  +  b).  (D)  Mean  chlorophyll  (red)  fluorescence  per  cell,  relative  to  standard  beads, 
derived  from  flow  cytometry. 
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and  Vaulot  1993,  Veldhuis  and  Kraay  1993)  and  can  reflect  the  coexistence  of  multiple 
ecotypes  (Moore,  et  al.  1998).  Here,  the  flow  cytometrically  derived  chlorophyll 
fluorescence  was  strongly  correlated  with  the  total  chlorophyll  per  cell  (r2=0.83)  and  the 
chi  b/ci2  ratio  (r2=0.76)  and  weakly  correlated  with  growth  rates  of  the  isolates  (Fig.  2). 

Sequences  of  ITS.  Sequences  of  the  16S  23S  rDNA  internal  transcribed  spacer  region 
were  determined  for  30  strains  of  Prochlorococcus  and  16  clonal  strains  of 
Synechococcus.  Although  the  majority  of  the  Prochlorococcus  isolates  have  not  been 
rendered  clonal,  PCR  amplifications  yielded  single  band  products  and  few  sequence 
ambiguities  were  observed.  All  of  the  strains  examined  contained  genes  encoding  two 
tRNAs,  for  Isoleucine  and  Alanine,  as  has  been  observed  in  freshwater  Synechococcus 
PCC6803  (Tomioka  and  Sugiura  1984). 

Sequencing  of  the  46  strains  resulted  in  34  unique  ITS  sequences.  Prochlorococcus 
strains  SS120,  SS2,  SS35,  SS51  are  clonal  derivatives  of  the  primary  culture  SARG,  and 
all  5  strains  had  identical  sequences.  Prochlorococcus  strain  MED4Ax  also  had  an 
identical  sequence  to  its  parent  strain  MED4.  Prochlorococcus  strains  MIT9107, 

MIT91 16,  MIT9123,  co-isolates  from  the  same  water  sample  in  the  South  Pacific,  were 
identical  to  each  other,  as  were  co-isolates  MIT9312  and  MIT931 1  from  the  Gulf  Stream. 
The  Prochlorococcus  strains  MIT9321,  MIT9322  and  MIT9401  also  possessed  identical 
ITS  sequences,  although  MIT9401was  isolated  from  the  Sargasso  Sea  while  the  other  two 
are  from  the  Equatorial  Pacific.  Synechococcus  WH7805,  WH8008,  and  WH8018, 
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Relative  chlorophyll  (red)  fluorescence  per  cell 


Figure  2.  Correlation  of  mean  chlorophyll  (red)  fluorescence  per  cell  with 
other  physiological  properties  for  15  isolates  of  Prochlorococcus  (A)  Total 
chlorophyll  (B)  chi  b/a2  ratio  (C)  growth  rate 
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isolated  from  the  Sargasso  Sea,  the  Gulf  of  Mexico  and  Woods  Hole  respectively,  also 
possess  identical  ITS  sequences.  For  the  remainder  of  this  paper  results  will  be  presented 
only  for  the  34  unique  sequences,  with  SS120,  MED4,  MIT9107,  MIT9312,  MIT9401, 
and  WH7805  used  to  represent  the  six  sets  of  identical  strains. 

Marked  differences  in  the  length  of  the  ITS  were  observed  among  the  34  unique 
sequences  (Fig.  3).  Lengths  of  the  ITS  ranged  from  539  bp  in  Prochlorococcus  MIT9314 
to  1017  bp  in  Synechococcus  PCC6307.  The  length  differences  were  strongly  correlated 
with  ecotype.  All  of  the  low  B/A  Prochlorococcus  have  ITS  regions  ranging  in  length 
from  541-550  bp.  The  high  B/A  Prochlorococcus  have  much  longer  ITS  sequences,  and 
there  is  a  larger  range  of  lengths  among  them.  NATL1A,  NATL2A,  SSI  20  and 
MIT9211  ITS  sequences  range  in  length  from  634  to  695  bp,  while  MIT9303  and 
MIT9313  have  spacers  of  833  and  831  bp  respectively.  Marine  Synechococcus  strains 
range  from  749  bp  (WH8017)  to  812  bp  (WH8103).  The  majority  of  the  length 
difference  is  in  the  3  end  of  the  spacer  (tRNAAIa  -  23S  spacer)  which  ranges  from  254- 
528  bp  across  the  strains  examined  (Fig.  3). 

Substantial  differences  also  exist  in  the  %G+C  content  of  the  ITS  (Fig.  4).  Low  B/A 
Prochlorococcus  ITS  sequences  had  the  lowest  %G+C  content  (37-39%).  Again,  the 
high  B/A  Prochlorococcus  spanned  a  range  of  values.  The  %G+C  content  of  sequences 
from  NATL  1  A,  NATL2A,  SS120  and  MIT921 1  was  quite  similar  to  that  of  the  low  B/A 
isolates.  However  MIT9303  and  MIT9313  had  higher  %G+C  contents  of  44  and  45%. 
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Figure  3.  Length  of  the  internal  transcribed  spacer  region  (ITS)  between  the  16S  and 
23S  rDNAs  in  Prochlorococcus  and  Synechococcus  isolates.  Low  B/A 
Prochlorococcus  isolates  all  have  short  spacers,  while  high  B/A  isolates  have  longer 
spacer  lengths  and  are  not  as  similar  to  one  another.  The  freshwater  strain  PCC6307 
has  a  much  longer  spacer  than  the  marine  Synechococcus  and  Prochlorococcus  isolates. 
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Figure  4.  %  G+C  content  of  the  16S  23  S  rDNA  internal  transcribed  spacer  (ITS) 
region  and  its  two  most  variable  regions,  the  16S-  tRNAlle  spacer  and  the  tRNAA,a-23S 
spacer.  Low  B / A  Prochlorococcus  and  4  high  B/A  Prochlorococcus  isolates  have  a 
lower  %GC  content  than  2  high  B/A  Prochlorococcus  and  Synechococcus  isolates. 
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Values  for  MIT9303  and  MIT93 13  are  in  the  range  of  the  majority  of  the  marine 
Synechococcus  strains  (44-46  %G+C).  Two  Synechococcus  strains,  WH8016  and 
WH9908,  did  have  lower  %G+C  contents  (41  and  43%).  As  with  its  much  longer  length, 
the  %G+C  content  of  the  ITS  sequence  from  Cyanobium  PCC6307  was  markedly 
different  (54%G+C)  from  all  of  the  Prochlorococcus  and  marine  Synechococcus.  The 
majority  of  the  strains  had  lower  %G+C  contents  in  the  two  spacer  regions  than  in  the 
ITS  as  a  whole,  presumably  because  of  the  constraints  necessary  to  maintain  secondary 
structure  of  the  tRNAs.  The  tRNAAla  -  23S  spacer  region  sequences  were  the  lowest  in 
%G+C  content  and  also  exhibited  the  widest  range  among  the  isolates  (Fig.  4). 

Phylogenetic  relationships.  Phylogenetic  analyses  used  the  16S-tRNADe  spacer  because 
the  more  extreme  differences  in  length  and  %G+C  content  (Fig.  3  and  4)  made 
alignments  difficult  for  the  tRNAAla  -  23S  spacer.  Because  the  shorter  length  of  the  16S- 
tRNAne  spacer  in  the  low  B/A  Prochlorococcus  limited  the  positions  available  in  the 
unambiguously  aligned  dataset,  parallel  analyses  were  conducted  with  and  without  the 
low  B/A  Prochlorococcus.  Phylogenetic  analyses  used  103  nucleotide  positions  when  all 
strains  were  considered  and  an  expanded  dataset  of  146  nucleotide  positions  when  only 
the  high  B/A  Prochlorococcus  and  Synechococcus  were  considered  (Fig.  5).  Cyanobium 
cluster  Synechococcus  branches  outside  of  the  marine  picoplankton  clade  in  16S  rDNA 
analyses  (Urbach,  et  al.  1998)  and  was  included  here  as  an  outgroup.  However,  the  large 
degree  of  sequence  dissimilarity  between  PCC6307  and  the  marine  sequences  precluded 
confident  alignments,  so  it  was  excluded  from  phylogenetic  analyses. 
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Figure  5.  (Continued): 
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In  phylogenetic  analyses  confined  to  103  base  pairs  of  the  16S-tRNADe  spacer,  the  low 
B/A  Prochlorococcus  form  a  well  supported  clade,  which  can  be  further  subdivided  into 
two  clusters,  one  containing  MED4  and  MIT9515,  and  the  other  containing  the  remaining 
low  B/A  isolates  (Fig.  6).  This  division  is  reflected  in  the  amount  of  sequence  difference 
between  the  low  B/A  isolates.  The  low  B/A  cluster  II  isolates  are  93-99%  similar  in  their 
16S-tRNADe  spacer  sequences,  while  MED4  and  MIT9515  are  only  89-93%  similar  to 
the  low  B/A  cluster  II  isolates  (Table  3). 

The  high  B/A  Prochlorococcus  do  not  form  a  monophyletic  clade,  but  are  dispersed  in 
four  distinct  lineages  (Fig.  6).  One  cluster,  designated  high  B/A  I,  contains  isolates 
NATL  1 A  and  NATL2A  which  consistently  branch  together  and  share  a  high  spacer 
sequence  similarity  (97%).  Isolates  SS120  and  MIT921 1  are  much  less  similar  to  each 
other  (82%).  Although  distance  methods  provide  some  support  for  SS120  and  MIT921 1 
branching  together,  parsimony  and  likelihood  methods  do  not  (Fig.  6).  Since  neither 
isolate  has  a  high  sequence  similarity  to  each  other  or  to  the  other  four  high  B/A  isolates 
(Table  3),  they  have  each  been  assigned  to  a  different  cluster  (Fig.  6).  Finally,  isolates 
MIT9303  and  MIT93 13  make  up  Prochlorococcus  high  B/A  cluster  IV.  Although 
support  for  this  grouping  varies  somewhat  depending  on  the  method  of  tree  inference 
(Fig.  6),  the  two  strains  are  95%  similar  in  their  16S-tRNADe  spacer  sequences  (Table  3) 
and  have  been  shown  to  be  quite  similar  physiologically  (Moore  and  Chisholm  1999). 
These  same  clusters  are  apparent  in  analyses  using  a  larger  subset  (146  bp)  of  the  16S- 
tRNAne  spacer  (Fig.  7). 


73 


Table  3.  Fractional  similarities  for  the  16S-tRNAIle  spacer 
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Figure  6.  Evolutionary  relationships  of  Prochlorococcus  and  Synechococcus  isolates 
inferred  using  103  positions  of  the  16S-  tRNAIle  spacer.  The  phylogenetic  framework 
was  determined  using  paralinear  distances  (logdet)  and  minimum  evolution  as  the 
objective  function.  Trees  inferred  using  HKY85  distances  resulted  in  identical  clusters 
with  essentially  similar  branching  orders.  Bootstrap  values  from  [distance/maximum 
parsimony/maximum  likelihood]  analyses  are  listed  to  the  left  of  each  node,  with 
values  less  than  50  not  shown.  The  tree  is  arbitrarily  rooted  with  WH8101. 
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The  expanded  subset  of  the  16S-tRNAIle  spacer  allowed  the  designation  of  strains  of 
marine  A  Synechococcus  are  in  four  different  clusters  (Fig.  7).  One,  designated  cluster  I, 
consists  of  the  low  PUB  strains  WH6501,  WH8002,  WH8012  and  WH8109  (Fig.  7). 
These  strains  are  very  similar  in  their  spacer  sequence  (96-99%)  (Table  3).  A  second 
well  supported  cluster  (II)  consists  of  the  motile  high  PUB  strains  WH8103,  WH81 12, 
and  WH81 13.  These  strains  also  show  very  high  sequence  similarity  (99%).  A  third 
cluster,  more  physiologically  and  genetically  divergent  than  the  first  two,  consists  of  the 
strain  WH7805,  which  lacks  PUB  (and  has  an  identical  sequence  to  strains  WH8008  and 
WH8018,  which  also  lack  PUB)  and  the  low  PUB  strains  WH7803  and  WH8017.  Strains 
WH7805  and  WH8017  differ  by  only  1%  over  the  16S-tRNAne  spacer,  while  WH7803  is 
94  %  similar  to  these  two  strains.  Finally,  Synechococcus  cluster  IV  contains  strains 
WH9908  and  WH8016  (only  2%  different),  which  form  a  well  supported  cluster, 
although  little  can  be  said  about  the  physiological  characteristics  which  may  define  this 
group.  WH8016  is  a  low  PUB  strain,  but  pigment  data  is  not  yet  available  for  the 
psychrophilic  strain  WH9908  which  was  isolated  in  Woods  Hole  when  water 
temperatures  were  below  10°  and  has  been  maintained  at  15°C  and  shows  no  growth  at 
23°C  (M.  Sullivan,  personal  communication). 

Although  the  expanded  dataset  of  146  positions  was  used  to  designate  strain  clusters  of 
high  B/A  Prochlorococcus  and  Synechococcus  because  it  offers  greater  resolution,  some 
of  these  clusters  are  also  apparent  on  the  tree  constructed  using  only  103  positions  (Fig.  6 
and  7).  For  example  high  B/A  clusters  I  and  IV  are  well  supported,  as  is  the  monophyly 
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Figure  7.  Evolutionary  relationships  of  high  B/A  Prochlorococcus  and  Synechococcus 
isolates  inferred  using  146  positions  of  the  16S-  tRNAlle  spacer.  The  phylogenetic 
framework  was  determined  using  paralinear  distances  (logdet)  and  minimum  evolution 
as  the  objective  function.  Trees  inferred  using  HKY85  distances  resulted  in  identical 
clusters  with  essentially  similar  branching  orders.  Bootstrap  values  from 
[distance/maximum  parsimony/maximum  likelihood]  analyses  are  listed  to  the  left  of 
each  node,  with  values  less  than  50  not  shown.  The  tree  is  rooted  with  WH8101. 
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of  the  Prochlorococcus  clade  as  a  whole,  although  it  is  less  well  supported  by  bootstrap 
values  with  the  smaller  dataset.  Synechococcus  cluster  II  is  also  supported  with  the 
smaller  dataset,  while  the  two  strains  in  Synechococcus  cluster  IV  are  identical  to  one 
another  over  these  103 bp,  as  are  the  four  strains  in  Synechococcus  cluster  I. 

The  marine  B  Synechococcus  isolate  WH8101  is  80-88%  similar  in  the  sequence  of  the 
16S-tRNA1Ic  spacer  to  the  marine  A  isolates,  approximately  as  distinct  as  each  of  the 
marine  A  Synechococcus  clusters  are  to  one  another  (Table  3).  This  supports  the 
suggestion  that  the  lineages  of  marine  A  Synechococcus  are  of  equivalent  taxonomic  rank 
to  the  division  between  marine  A  and  B  clusters  (Urbach,  et  al.  1998). 

DISCUSSION 

The  higher  variability  of  the  ITS  compared  to  the  16S  rDNA  has  enabled  the 
identification  of  sequence  similarity  clusters  among  the  closely  related  isolates  of 
Prochlorococcus  and  Synechococcus.  Given  the  phylogenetic  divergence  of  the  high 
B/A  isolates  into  four  clusters,  the  question  arises  whether  the  designation  high  and  low 
B/A  ecotypes  is  still  valid  for  Prochlorococcus.  Recently,  Rippka  et  al.  (2000)  have 
proposed  a  subspecies  pastoris  to  describe  low  B/A  Prochlorococcus.  The  proposed  type 
strain  PCC951 1  has  a  16S  rDNA  sequence  identical  to  the  well  characterized  low  B/A 
isolate  MED4  (Rippka,  et  al.  2000).  There  are  undoubtedly  more  taxonomic  distinctions 
to  be  made,  as  each  of  the  high  B/A  clusters  is  a  lineage  of  equal  rank  to  the  low  B/A 
ecotype.  However,  in  the  absence  of  more  striking  ecologically  relevant  physiological 
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differences  among  the  high  B/A  isolates,  the  designations  high  and  low  B/A  ecotype 
remain  useful.  They  accurately  describe  culture  physiologies  which  are  correlated  with 
phylogenetic  relationships  and  are  convenient  terms,  to  be  used  with  the  caveat  that  the 
high  B/A  ecotype  consists  of  a  wider  range  of  physiologies  and  genetic  diversity  than  the 
low  B/A  ecotype. 


Physiological  characterization  of  15  Prochlorococcus  isolates  by  measuring  their  chi  b/a2 
ratio  at  one  light  level  enabled  the  assignment  of  strains  NATL1A  and  NATL2A  to  the 
high  B/A  ecotype  and  the  remaining  thirteen  isolates  to  the  low  B/A  ecotype.  These 
designations  are  supported  by  the  phylogenetic  relationships  of  the  isolates  using  the  ITS 
sequences.  The  chi  b/a2  ratios  of  NATL  1 A  and  NATL2A  are  on  the  low  end  of  values 
observed  for  high  B/A  isolates  of  Prochlorococcus.  However,  values  determined  here 
using  a  spectrophotometer  may  be  underestimates  of  chi  b/ci2  compared  to  values  derived 
from  HPLC  methods  as  employed  by  Moore  and  Chisholm  (1999).  Others  have  found 
that  estimates  of  chi  b/a2  for  the  same  culture  measured  by  spectrofluorometry  were  70% 
of  those  determined  using  HPLC  (Morel,  et  al.  1993).  The  chi  b/a2  ratio  of  NATL1A  is 
similar  to  the  HPLC-determined  value  of  0.95  (when  grown  under  7.5  pmol  Q  m'2  s'1 
white  light)  first  reported  for  this  isolate  (Partensky,  et  al.  1993).  Additionally,  the  range 
of  chi  b/a2  ratios  for  the  four  high  B/A  isolates  is  broader  than  for  the  low  B/A  isolates. 
SSI 20  and  M1T921 1  have  much  higher  ratios  (0.7  -  2.5)  over  all  irradiances  than 
MIT9303  and  MIT9313  (0.5  -  1.3)  (Moore  and  Chisholm  1999).  This  physiological 
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diversity  is  consistent  with  greater  genetic  diversity  among  high  B/A  isolates  seen 
previously  using  16S  rDNA  sequences  (Rocap,  et  al.  1999)  and  here  with  ITS  sequences. 

With  the  addition  of  physiological  data  for  these  15  isolates,  it  is  clear  that  the  majority  of 
the  Prochlorococcus  isolates  in  culture  collections  are  of  the  low  B/A  ecotype  (19  as 
compared  to  only  6  high  B/A  isolates,  of  the  25  physiologically  characterized  strains  in 
this  study).  This  is  in  spite  of  concentrated  efforts  to  culture  additional  high  B/A  isolates 
by  sampling  at  depth,  where  they  are  presumed  to  predominate,  and  then  mimicking  the 
in  situ  light  regime.  Although  it  is  possible  that  low  B/A  Prochlorococcus  are  actually 
numerically  dominant  in  many  parts  of  the  ocean  and  our  collection  of  isolates  represents 
their  natural  frequencies,  this  is  seems  unlikely  given  the  potential  sources  of  isolation 
bias,  which  could  select  for  low  B/A  strains.  For  example,  filtration  steps  necessary  to 
eliminate  the  larger  Synechococcus  could  also  preferentially  remove  larger 
Prochlorococcus  similar  to  the  high  B/A  cluster  IV  isolates  MIT9303  and  MIT9313, 
which  exhibit  much  higher  forward  angle  light  scatter  (a  parameter  related  to  cell  size) 
than  other  isolates  of  Prochlorococcus  (Moore  1997).  It  is  noteworthy  that  both  of  these 
isolates  were  obtained  by  flow  cytometric  sorting,  not  filtration.  Additionally,  culture 
media  may  favor  low  B/A  Prochlorococcus  (possibly  because  of  differing  trace  metal 
requirements  or  tolerances)  and  they  may  overtake  mixed  cultures. 

The  observation  of  multiple  isolates  with  identical  ITS  sequences  underscores  the 
cosmopolitan  nature  of  these  genera  and  is  consistent  with  previous  studies  which 
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identified  Prochlorococcus  isolates  with  identical  16S  rDNA  sequences  (MIT9202,  SB 
and  TATL2)  from  disparate  regions  of  the  worlds  oceans  (Urbach,  et  al.  1998,  Rocap,  et 
al.  1999).  At  the  more  variable  ITS,  Prochlorococcus  MIT9321  and  MIT9322,  isolated 
within  days  of  each  other  in  the  Equatorial  Pacific,  are  also  identical  to  MIT9401  from 
the  Sargasso  Sea.  Similarly  Synechococcus  strains  WH8008,  WH8018  and  WH7805 
isolated  from  the  Gulf  of  Mexico,  Woods  Hole,  and  the  Sargasso  Sea  respectively,  have 
identical  ITS  sequences. 


Of  course,  the  sensitivity  of  the  PCR  to  even  minute  amounts  of  DNA  means  that  caution 
is  warranted  in  deeming  sequences  identical,  especially  in  a  situation  where  the  same  set 
of  primers  is  used  repeatedly  on  different  templates.  However,  several  lines  of  evidence 
suggest  that  the  identical  sequences  are  features  of  the  isolates  and  not  due  to 
contamination.  First,  because  the  genomic  DNA  from  cultures  being  investigated  was  in 
plentiful  supply,  PCR  reactions  employed  generous  amounts  of  template  DNA.  Second, 
reactions  lacking  template  were  always  run  as  a  control  and  did  not  result  in  amplification 
products  as  judged  by  agarose  gels.  Third,  many  of  the  identical  sequences  were 
prepared  several  months  apart  from  one  another  and  other  cultures  were  amplified  in  the 
interim  which  yielded  different  sequences,  suggesting  there  was  not  a  rampant 
contamination  problem.  Finally,  it  is  noteworthy  that  in  no  case  did  a  culture  yield  a 
sequence  which  was  incongruent  with  its  physiology,  as  might  be  expected  to  occur  if  the 
identical  sequences  were  the  result  of  chance  contamination. 
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The  monophyly  of  low  B/A  Prochlorococcus  demonstrated  here  with  the  ITS  (Fig.  6)  is 
consistent  with  previous  results  from  16S  rDNA  and  other  protein  coding  genes  (Urbach, 
et  al.  1998).  The  subdivision  of  the  low  B/A  ecotype  is  also  consistent  with  16S  rDNA 
analyses  in  which  a  well  supported  branching  pattern  separates  MED4  from  other  low 
B/A  isolates  (Rocap,  et  al.  1999).  Although  no  significant  physiological  differences  have 
been  observed  between  isolates  of  these  two  clusters,  they  may  have  differential 
distributions  in  the  ocean,  supporting  the  contention  that  sequence  similarity  clusters  as 
defined  by  the  ITS  correspond  to  ecologically  distinct  populations.  In  surface  waters  in 
the  North  Atlantic  MED4-like  Prochlorococcus  sequences  were  detected  by  using  probes 
to  amplified  16S  rDNA  while  the  low  B/A  cluster  II  could  not  be  detected  (West  and 
Scanlan  1999). 

High  B/A  Prochlorococcus  are  distributed  in  four  clusters,  each  only  about  80%  similar 
to  one  another.  This  greater  genetic  divergence  among  the  high  B/A  Prochlorococcus  is 
consistent  with  the  larger  differences  observed  in  their  physiologies.  For  example,  in 
addition  to  the  higher  chi  b/a2  ratio  of  SS120  and  MIT921 1  and  the  larger  cell  size  of 
MIT9303  and  MIT9313  already  described  (Moore  1997,  Moore  and  Chisholm  1999), 
MIT9303  and  MIT9313  also  possess  an  unidentified  pigment,  possibly  an  epoxide  of 
isocryptoxanthin  (Moore  1997).  Further,  when  grown  under  high  light  MIT9303  and 
M1T9313  have  higher  cell-specific  light  harvesting  efficiencies  and  maximum 
photosynthetic  rates  than  SS120  and  MIT921 1  (Moore  and  Chisholm  1999).  As  with  the 
two  low  B/A  clusters,  there  is  some  evidence  that  the  high  B/A  clusters  may  have 
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different  distributions  in  the  oceans.  In  a  North  Atlantic  water  column  sequences 
corresponding  to  the  NATL1A/NATL2A  type  (high  B/A  cluster  I)  were  detected  by 
probing  PCR  amplified  16S  rDNA,  while  binding  to  probes  specific  for  SSI 20  and 
MIT9303  was  not  detected  (West  and  Scanlan  1999). 

Marine  A  Synechococcus  encompass  at  least  four  sequence  similarity  clusters.  One  of 
these,  cluster  II  can  be  associated  with  a  physiological  trait  as  all  of  its  member  strains 
are  capable  of  swimming.  Motile  Synechococcus  very  likely  occupy  a  unique  niche 
because  the  ability  to  migrate  towards  patches  of  nitrogenous  compounds  (e.g.,  on  marine 
snow)  allows  them  to  access  nutrient  sources  other  Synechococcus  and  Prochlorococcus 
cannot  (Willey  and  Waterbury  1989).  This  cluster  is  also  consistent  with  analyses  using 
rpoCl  in  which  motile  isolates  with  a  range  of  PUB  types  form  a  single  clade  (Toledo,  et 
al.  1999).  Analyses  using  rpoCl  also  affiliate  cluster  m  isolates  WH7803  and  WH7805 
(Toledo  and  Palenik  1997).  Unfortunately  it  is  not  yet  possible  to  completely  compare 
the  clusters  identified  using  the  ITS  with  those  delineated  by  rpoCl  because  the  majority 
of  the  strains  used  in  the  two  studies  are  different.  More  strains  need  to  be  examined 
using  both  rpoCl  and  ITS  sequences  to  make  definitive  conclusions  about  correlations  of 
sequence  clusters  and  physiological  (particularly  pigment)  type. 

The  branching  order  of  the  low  B/A  Prochlorococcus  clade  with  respect  to  the  high  B/A 
clusters  and  Synechococcus  suggests  that  the  low  B/A  clade  is  more  recently  arisen.  If 
the  ability  to  synthesize  divinyl  chlorophylls  a  and  b  has  been  acquired  only  once  (i.e.  if 
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the  root  of  the  tree  is  outside  the  Prochlorococcus  clade,  which  is  reasonable  to  assume) 
then  the  general  possession  of  divinyl  chlorophyll  a  and  b  is  a  primitive  state  shared  by 
all  Prochlorococcus,  and  the  low  B/A  clade  is  a  derived  state.  This  is  consistent  with  a 
possible  evolutionary  scenario  in  which  a  phycobilisome  containing  ancestor  acquired  the 
ability  to  synthesize  divinyl  chlorophylls  a  and  b,  allowing  it  to  colonize  the  deep 
euphotic  zone,  where  competition  for  nutrients  is  reduced,  as  no  other  photosynthetic 
organisms  can  thrive  there.  Then  later  modifications,  including  the  ability  to  survive  in 
higher  light  and  higher  copper  concentrations  characteristic  of  surface  waters,  enabled  the 
low  B/A  clade  to  be  more  competitive  throughout  the  water  column. 

CONCLUSIONS 

Using  the  sequences  of  the  16S  23S  rDNA  internal  transcribed  spacer  region  we  have 
successfully  delineated  these  two  genera  of  marine  cyanobacteria  into  strain  clusters 
which  likely  represent  ecological  units.  For  some  of  the  clusters,  for  example  the  low 
B/A  Prochlorococcus,  some  inferences  can  be  made  about  the  nature  of  the  niche  these 
organisms  are  occupying  (here,  high  light  surface  waters).  In  other  cases,  as  with  some 
of  the  Synechococcus  clusters,  there  is  not  an  obvious  hypothesis  to  explain  the  different 
niches  these  lineages  may  be  occupying.  The  contribution  of  this  work  is  in  identifying 
the  sequence  similarity  clusters  which  potentially  correspond  to  ecologically  distinct 
units.  This  allows  for  more  informed  selection  of  strains  in  laboratory  experiments,  as 
representative  strains  from  each  lineage  can  now  be  employed  in  further  physiological 
studies  to  discover  the  features  of  each  cluster  which  may  lead  to  niche  differentiation. 
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The  examination  of  sequences  directly  from  the  environment,  thus  avoiding  culturing 
biases,  has  been  widespread  in  recent  years  and  has  given  new  insights  into  the  diversity 
of  environmental  bacteria  (Giovannoni,  et  al.  1990,  Ward,  et  al.  1990)  including  marine 
cyanobacteria  (Palenik  1994,  Ferris  and  Palenik  1998,  Urbach  and  Chisholm  1998). 
Sequence  clusters  identified  by  the  ITS  are  excellent  candidates  for  direct  sequence 
diversity  studies  of  field  populations  of  marine  cyanobacteria  because  the  ITS  is  variable 
enough  to  differentiate  the  ecotypes  unambiguously.  Further,  the  sequence  data 
presented  here  allows  the  design  of  oligonucleotides  specific  for  each  cluster  which  can 
be  used  as  probes  in  quantitative  or  in  situ  hybridizations  (Giovannoni,  et  al.  1988, 
DeLong,  et  al.  1989)  or  primers  in  quantitative  PCR  (Lee,  et  al.  1996,  Tay,  et  al.  2000). 
Using  these  tools,  the  distribution  and  relative  abundances  of  each  cluster  can  be  studied 
under  different  environmental  conditions  (Field,  et  al.  1997),  ultimately  providing  a  better 
understanding  of  the  forces  affecting  the  evolution  and  population  dynamics  of  this 
globally  successful  clade  of  cyanobacteria. 
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CHAPTER  FIVE 

Genetic  diversity  of  Prochlorococcus  and  Synechococcus  populations  in  the 

Sargasso  Sea 
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ABSTRACT 


The  diversity  of  Sargasso  Sea  cyanobacterial  populations  was  examined  in  summer  and 
winter  by  restriction  length  polymorphism  and  sequencing  analysis  of  the  16S/23S 
ribosomal  DNA  internal  transcribed  spacer  (ITS).  Cyanobacterial  ITS  sequences  were 
amplified  from  total  community  genomic  DNA  in  three  different  samples,  collected  to 
represent  different  temperature  and  nutrient  regimes.  PCR  products  were  cloned  and 
screened  with  two  tetrameric  restriction  enzymes.  A  combined  total  of  152  clones  from 
the  three  clone  libraries  yielded  19  different  RFLP  types.  ITS  sequences  from 
representatives  of  each  RFLP  type  were  compared  to  sequences  from  Prochlorococcus 
and  Synechococcus  cultures  and  all  but  one  clone  could  be  assigned  to  one  of  seven 
ecotypic  clusters.  The  single  leftover  clone  is  has  a  relatively  low  sequence  similarity 
(-60%)  to  other  environmental  or  cultured  sequences  of  marine  cyanobacteria.  A  larger 
number  of  Prochlorococcus  clones  were  recovered  compared  to  Synechococcus  than 
would  be  expected  from  the  relative  cell  abundances  of  each  genus  in  the  original  field 
samples. 


INTRODUCTION 

Primary  production  in  the  open  ocean  is  dominated  by  the  unicellular  cyanobacteria 
Prochlorococcus  and  Synechococcus.  Although  closely  related  phylogenetically  (Palenik 
and  Haselkom  1992,  Urbach,  et  al.  1992),  they  differ  in  their  cell  sizes  and  pigment 
complements,  enabling  them  to  be  easily  distinguished  and  enumerated  by  flow 
cytometry  (Olson,  et  al.  1990a,  Olson,  et  al.  1990b).  Together  Prochlorococcus  and 
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Synechococcus  comprise  up  to  50%  of  carbon  fixation  in  oligotrophic  waters  (Campbell 
and  Vaulot  1993,  Goericke  and  Repeta  1993,  Veldhuis,  et  al.  1993,  Binder,  et  al.  1996). 
Isolates  of  Prochlorococcus  and  Synechococcus  with  different  photosynthetic  capacities 
have  been  shown  to  coexist  in  natural  samples  (Toledo  and  Palenik  1997,  Moore,  et  al. 
1998).  Knowledge  of  the  genetic  diversity,  and  corresponding  physiological  diversity,  of 
these  important  photoautotrophs  will  provide  a  better  understanding  of  the  processes  that 
influence  rates  of  photosynthesis  in  the  ocean. 

Prochlorococcus  isolates  can  be  divided  into  two  physiologically  and  genetically  distinct 
groups,  referred  to  as  ecotypes  because  their  differing  physiologies  have  implications  for 
their  ecological  distributions.  The  primary  differentiator  of  the  ecotypes  is  their  chi  b/a2 
ratio,  although  they  also  differ  in  their  light-dependent  physiological  responses  (growth 
rate,  pigment  content,  and  photosynthetic  rate  as  a  function  of  light  intensity).  Isolates 
with  a  high  chi  b/a2  ratio  (high  B/A  ecotype)  are  much  more  efficient  at  utilizing  low 
light  than  those  with  a  low  chi  bla2  ratio  (low  B/A  ecotype),  but  are  incapable  of  growth 
at  higher  irradiances  (Moore  and  Chisholm  1999).  In  addition,  high  B/A  isolates  are 
much  more  sensitive  to  copper  than  isolates  of  the  low  B/A  ecotype  (Mann  2000). 
Analysis  of  16S  rDNA  and  16S  23S  internal  transcribed  spacer  (ITS)  sequences 
demonstrates  that  low  B/A  isolates  are  closely  related  and  form  a  well  supported  clade, 
which  can  be  further  divided  into  two  clusters  (Urbach,  et  al.  1998,  Rocap,  et  al.  1999). 

In  contrast,  the  high  B/A  isolates  are  divided  among  of  four  clusters  which  are  less 
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similar  in  their  ITS  sequence  and  do  not  form  a  monophyletic  clade  (Rocap,  et  al.  1999, 
Chapter  4). 

Marine  Synechococcus  strains  are  also  physiologically  and  genetically  diverse. 
Synechococcus  can  be  distinguished  based  on  the  pigment  composition  of  their  light 
harvesting  complexes,  as  isolates  vary  in  the  amounts  of  the  chromophores  phycourobilin 
(PUB)  and  phycoerythrobilin  (PEB)  which  attach  to  the  phycobiliprotein  phyocerythrin 
(Waterbury,  et  al.  1986).  Strains  also  exhibit  other  physiological  differences  in 
swimming  motility,  nutrient  utilization  and  cell  cycle  behavior  (Waterbury,  et  al.  1986, 
Binder  and  Chisholm  1995).  Investigations  into  the  genetic  diversity  of  Synechococcus 
using  DNA-dependent  RNA  polymerase  gene  sequences  ( rpoCl )  as  well  as  the  ITS  have 
identified  numerous  strain  clusters  (Toledo  and  Palenik,  1997,  Toledo  et  al.  1999, 

Chapter  4).  However  the  relationship  of  these  strain  clusters  to  physiological  diversity  is 
not  completely  clear,  except  in  the  case  of  one  cluster  which  contains  all  the  motile 
isolates  examined  to  date  (Toledo  and  Palenik,  1997,  Toledo  et  al.  1999,  Chapter  4). 

A  major  issue  in  extrapolating  physiological  properties  of  cultured  isolates  to  natural 
populations  is  whether  the  strains  represent  dominant  members  of  environmental 
communities.  Sequence  diversity  has  been  assessed  in  field  populations  of  marine 
cyanobacteria  using  rpoCl  sequences  (Palenik  1994,  Ferris  and  Palenik  1998)  and  in 
Prochlorococcus  specifically  using  16S  rDNA  (West  and  Scanlan  1999)  and  the 
intergenic  region  between  the  photosynthetic  electron  transport  genes  petB  and  petD 
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( petB/D )  (Urbach  and  Chisholm  1998).  PCR  amplified  16S  rDNA  sequences  from  a 
North  Atlantic  water  column  could  all  be  affiliated  with  two  Prochlorococcus  clusters 
(high  B/A  I  and  low  B/A  I)  (West  and  Scanlan  1999).  However,  the  extent  of 
physiological  diversity  represented  by  the  genetic  diversity  found  at  rpoCl  or  petB/D  is 
not  clear,  partly  because  of  the  relatively  few  Prochlorococcus  isolates  sequenced  at 
these  loci,  and  also  because  the  connection  between  physiological  and  genetic  diversity  in 
Synechococcus  is  still  being  established  (Toledo  and  Palenik  1997,  Toledo,  et  al.  1999). 

The  internal  transcribed  spacer  (ITS)  region  between  the  16S  and  23S  rRNAs  has 
recently  been  used  to  describe  the  multiple  ecotypes  of  Prochlorococcus  and 
Synechococcus  (Chapter  4).  The  higher  variability  of  this  region  compared  to  the  16S 
rDNA  allowed  the  resolution  of  ten  sequence  similarity  clusters,  each  of  which  likely 
corresponds  to  an  ecologically  distinct  population.  The  aims  of  this  study  were  to 
characterize  the  sequence  diversity  of  natural  cyanobacterial  populations  at  the  ITS,  using 
the  sequences  from  Prochlorococcus  and  Synechococcus  cultures  as  a  framework  to 
attach  a  phenotype  to  environmental  genotypes. 

MATERIALS  AND  METHODS 

Sampling  locations.  Water  samples  were  collected  on  two  cruises  to  the  Sargasso  Sea 
aboard  the  RV  Oceanus.  PCR  clone  libraries  were  constructed  from  samples  obtained 
from  32°  08’  N,  70°02’  W  at  75  m  on  June  10,  1996  (OC279  station  6  CTD  31)  and  from 
35°48’  N,  61°56’  W  at  15  m  and  150  m  on  January  30,  1997  (OC297  station  5  CTD  12). 
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Water  was  collected  from  Niskin  bottles.  Temperature  and  in  vivo  chlorophyll 
fluorescence  data  were  recorded  by  a  SeaBird  CTD  mounted  to  the  sampling  rosette. 

Nutrient  data  presented  are  from  the  same  cruises  and  stations  but  different  CTDs.  June 
data  is  from  32°  18’  N,  70°  1 T  W  (OC279  station  6  CTD  18)  and  January  data  was 
obtained  from  35°.83  N  and  61°.95  W  (OC297  station  5  CTD  31).  The  collection  and 
measurement  of  inorganic  nitrogen  (nitrate  and  nitrite,  or  N+N)  and  soluble  reactive 
phosphorous  (SRP)  is  described  by  Cavender-Bares  (1999). 

Cell  counts.  Seawater  (1.2  ml)  was  preserved  for  flow  cytometry  in  0.125% 
glutaraldehyde  (final  concentration)  and  stored  in  liquid  nitrogen.  Samples  were 
analyzed  on  a  modified  FACScan  flow  cytometer  (Dusenberry  and  Frankel  1994)  using 
Cyclops  software  (Cicero). 

DNA  isolation.  Immediately  after  collection  between  1-2 1  of  seawater  was  filtered  on  to 
a  0.2  Jim  filter  (nucleopore)  and  stored  in  liquid  nitrogen.  For  DNA  extraction,  filters 
were  thawed,  sliced  in  half,  and  one  half  was  refrozen.  The  half  used  for  extraction  was 
minced  and  added  to  2  ml  lysis  solution  (0.5  M  NaCl,  10  mM  EDTA,  10  mM  Tris  pH 
8.0),  and  0.2  ml  10%  SDS  was  added  dropwise  while  vortexing.  The  sample  was  boiled 
for  2  minutes,  spun  10  min  in  a  microcentrifuge  at  10k  rpm,  and  the  supernatant  was 
transferred  to  new  tubes.  The  sample  was  extracted  twice  with  25:24:1  phenol: 
chloroform:  isoamyl  alcohol  and  once  with  24: 1  chloroform:  isoamyl  alcohol,  and  then 
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0.12  ml  10  M  ammonium  acetate  and  0.75  ml  ice  cold  ethanol  were  added.  DNA  was 
precipitated  at  -20°C  overnight  and  then  spun  10  min  in  a  microfuge  at  4°C,  washed  with 
70%  ethanol,  dried  at  room  temperature  and  resuspended  in  TE. 

PCR  clone  library  construction.  Portions  of  the  16S  and  23S  rDNAs  and  the  entire  ITS 
between  them  (the  ITS/23S  locus)  were  amplified  from  the  three  samples  using  primers 
16S-1247f  (CGTACTACAATGCTACGG)  and  23S-1608r  (CYACCTGTGTCGGTTT). 
Primer  16S-1247f  is  cyanobacterial  specific,  as  determined  using  the  PROBE-CHECK 
function  in  the  newest  release  of  the  RDP  (Maidak,  et  al.  1999).  PCR  reactions  employed 
cycling  parameters  described  in  Chapter  4.  Reactions  were  performed  in  quintuplicate, 
pooled,  and  excess  primers  removed  using  Strataprep  columns  (Stratagene).  An  aliquot 
was  visualized  on  a  1%  agarose  gel  to  ensure  that  only  bands  in  the  2300-2700  bp  size 
range  were  present.  Control  reactions  lacking  template  DNA  were  always  performed  in 
parallel  and  gave  no  products,  as  judged  by  visualization  on  agarose  gels. 

Pooled  purified  PCR  products  were  cloned  into  pCR-Script  vectors  (Stratagene)  and 
transformed  into  Super  competent  cells  (Stratagene)  according  to  the  manufacturer’s 
instructions.  Positive  colonies,  as  judged  by  blue-white  color  selection,  were  picked  and 
struck  on  selective  plates  to  confirm  the  color  phenotype  and  obtain  single  colonies, 
which  were  picked  and  grown  overnight  in  LB  supplemented  with  appropriate  antibiotics. 
These  liquid  cultures  were  used  for  frozen  stocks  (mixed  1 : 1  with  glycerol  storage  buffer 
and  stored  at  -80°C  (Ausubel,  et  al.  1992))  and  for  plasmid  purification  using  the 
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Miniprep-24  (MacConnell  Research).  Plasmids  were  digested  with  Eco  RV  (New 
England  Biolabs),  which  cuts  the  pCR-Script  plasmid  once  (in  the  polylinker).  Plasmids 
with  appropriate  sized  inserts  (roughly  50%)  were  selected  for  further  analysis.  The 
majority  of  inserts  were  not  cut  by  Eco  RV. 

Restriction  fragment  length  polymorphism  (RFLP)  analysis.  Minipreps  (0.1-1  pi) 
were  used  as  templates  to  amplify  the  ITS/23S  from  each  plasmid  using  the  PCR 
conditions  and  primers  described  above,  except  only  single  reactions  were  performed  for 
each  template.  PCR  products  (3-8  pi)  were  independently  digested  in  20  pi  reactions 
with  the  tetrameric  restriction  enzymes  Hae  III  (GG/CC)  and  Rsa  I  (GT/AC)  (New 
England  Biolabs)  according  to  manufacturer’s  instructions.  Digests  were  visualized  on 
2%  agarose  TBE  gels  stained  with  ethidium  bromide.  Fragments  were  sized  using  Eagle 
Eye  software  (Stratagene)  and  only  those  larger  than  200bp  were  considered  in  resolving 
the  RFLP  patterns.  The  combined  results  of  the  two  single  digests  were  used  to 
determine  RFLP  type. 

For  RFLP  analysis  of  cultures,  isolates  of  Prochlorococcus  and  Synechococcus  were 
grown  and  genomic  DNA  was  extracted  as  described  in  Chapter  4.  Amplification  of  the 
ITS/23S  and  determination  of  the  RFLP  type  for  each  culture  was  performed  as  above. 


Sequencing.  A  1000-1300  bp  fragment  spanning  the  ITS  was  sequenced  in  at  least  two 
clones  of  each  RFLP  type  (where  possible)  from  each  sample  (51  clones  total).  PCR 
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products  from  each  plasmid  obtained  as  above  were  sequenced  on  one  strand  using 
primers  16S-1247f  and  23S-241r  (TTCGCTCGCCRCTACT).  Sequencing  reactions 
were  performed  with  ABI  Big  Dye  terminator  sequencing  kits  (PE  Biosystems)  according 
to  the  manufacturer’s  instructions  and  run  on  an  ABB  10  automated  sequencer. 

Sequences  of  five  clones  from  the  Jan.  150  m  clone  library  (47,  59,  144,  145,  150)  whose 
phylogenetic  affiliation  was  initially  uncertain  were  confirmed  by  second  strand 
sequencing  using  primers  ITS-alaf  (TWTAGCTCAGTTGGTAGAG)  and  ITS-alar 
(CTCTACCAACTGAGCTAWA).  Base  calls  in  sequence  chromatagram  files  were 
manually  checked  using  Chromas  v.  1.43  (publicly  available  at 
http://www.technelysium.com.au/chromas.html). 

Phylogenetic  analysis.  Sequences  were  compiled  and  aligned  manually  with  ITS 
sequences  of  cultured  cyanobacteria  (Chapter  4)  using  the  Genetic  Data  Environment 
(Smith,  et  al.  1994).  Phylogenetic  analyses  employed  PAUP*  version  4b2a  (Swofford 
1999).  Distance  trees  were  inferred  using  minimum  evolution  as  the  objective  criterion 
and  paralinear  (logdet)  or  HKY85  distances.  Maximum  likelihood  analyses  used  the 
HKY85  model  of  nucleotide  substitution  with  rate  heterogeneity  and  empirical  nucleotide 
frequencies.  The  gamma  shape  parameter  and  the  transition/transversion  ratio  were 
initially  estimated  from  a  distance  topology  and  refined  by  iterative  likelihood  searches. 
All  heuristic  searches,  including  bootstrap  analyses  (100  resamplings),  utilized  random 
addition  and  tree-bisection  reconnection  (TBR)  branch-swapping  methods.  Phylogenetic 
trees  were  visualized  with  TREEVIEW  (Page  1996). 
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RESULTS 


Sampling  locations.  Seawater  samples  were  collected  on  two  cruises  to  the  Sargasso 
Sea  in  June  1996  and  January  1997.  The  water  column  in  June  exhibited  a  typical 
summer  profile  for  the  Sargasso  Sea  (Olson,  et  al.  1990b)  with  a  shallow  mixed  layer  and 
a  deep  chlorophyll  maximum  (Fig.  1A).  Nutrient  concentrations  were  low  in  surface 
waters  (Fig.  IB)  (Cavender-Bares  1999).  Concentration  of  Synechococcus  were  fairly 
uniform  throughout  the  water  column  while  Prochlorococcus  were  depleted  in  surface 
waters  and  exhibited  a  subsurface  maximum  (Fig.  1C). 

In  contrast,  the  water  column  in  January  exhibited  a  deep  mixed  layer,  typical  for  winter 
in  the  Sargasso  Sea  (Fig.  ID)  (Olson,  et  al.  1990b).  Consistent  with  the  constant 
temperature  over  the  upper  150  m,  nutrient  concentrations  were  relatively  constant  with 
depth.  Nitrogen  concentrations  in  surface  waters  were  100  fold  greater  than  in  the  June 
profile  (Fig.  IE)  (Cavender-Bares  1999).  Cell  numbers  for  both  Prochlorococcus  and 
Synechococcus  were  also  relatively  constant  with  depth.  Prochlorococcus  were  —20% 
more  abundant  than  Synechococcus  throughout  water  column  (Fig.  IF). 

PCR  clone  library  construction.  Three  samples  from  the  water  columns  described 
above  were  used  to  construct  PCR  clone  libraries  (Fig.  1C,  F).  In  June  a  sample  from  75 
m,  where  Prochlorococcus  had  a  maximum  in  cell  numbers,  was  successfully  used  to 
create  a  PCR  clone  library.  Samples  were  also  taken  from  15  m  and  25  m,  but  due  to 
extremely  low  DNA  recovery  from  these  samples,  cyanobacterial  ITS  sequences  could 
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Figure  1.  Water  column  characteristics  at  sampling  locations  from  summer  (A,  B,  C) 
and  winter  (D,  E,  F)  cruises.  (A,  D)  Temperature  and  chlorophyll  with  depth.  (B,  E  ) 
Total  nitrogen  (nitrate  and  nitrite  (N+N))  and  soluble  reactive  phosphorus  (SRP) 
(from  Cavender-Bares,  1999)).  (C,  F)  Total  abundance  of  Prochlorococcus  and 
Synechococcus.  Arrows  indicate  sampling  locations  for  PCR  clone  libraries. 
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not  be  reproducibly  amplified,  and  no  PCR  clone  libraries  were  constructed  from  surface 
waters  of  this  profile.  In  January  samples  from  near  the  surface  (15  m)  and  at  the  base  of 
the  mixed  layer  (150  m)  were  employed  as  material  for  PCR  clone  library  construction. 


The  ITS/23S  locus  was  amplified  from  nucleic  acids  extracted  from  the  three  samples 
using  primers  to  the  16S  and  23S  rDNAs.  Only  the  forward  (16S  rDNA)  primer  was 
cyanobacterial  specific,  but  no  sequences  suspected  to  originate  from  heterotrophic 
bacteria  were  found  in  any  of  the  libraries.  The  PCR  products  were  cloned  and  152 
clones  (48,  32  and  72  from  the  June  75  m,  Jan.  15  m  and  Jan.  150  m  libraries 
respectively)  were  recovered  and  subjected  to  further  analysis. 

Determination  of  RFLP  types.  The  tetrameric  restriction  enzymes  Hae  III  and  Rsa  I 
were  originally  chosen  to  screen  the  environmental  sequences  because  restriction  patterns 
of  ITS/23S  sequences  distinguish  high  B/A  Prochlorococcus  from  low  B/A  isolates  (Fig. 
2).  The  enzymes  are  also  able  to  resolve  the  clusters  within  the  Prochlorococcus 
ecotypes  (Fig.  2),  and  clusters  of  Synechococcus  isolates  (data  not  shown). 

Amplified  ITS  sequences  from  clones  in  the  three  environmental  libraries  were  screened 
independently  with  Hae  HI  and  Rsa  I.  In  several  cases  clones  were  found  with  identical 
restriction  patterns  with  one  of  the  enzymes  and  different  banding  patterns  with  the  other 
enzyme;  these  were  designated  separate  RFLP  types.  For  example,  types  C  and  D  show 
the  same  restriction  pattern  with  Rsa  I  (Fig.  3A)  but  a  different  one  when  Hae  HI  is  used 
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Figure  2.  Restriction  digests  of  cultured  Prochlorococcus  ITS/23S  sequences.  (A) 
Digests  using  Rsa  I.  (B)  Digests  using  Hae  HI. 
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Figure  3.  Restriction  digests  of  the  ITS/23S  in  environmental  clones 
representative  of  each  RFLP  type  found.  (A)  Rsa  I  digests  (B)  Hae  HI  digests 
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(Fig.  3B).  Based  on  the  combined  patterns  of  the  two  single  digests  a  total  of  19  RFLP 
types  were  identified  among  the  three  samples  (Fig.  3).  The  June  75  m  library  contained 
9  RFLP  types,  the  Jan.  15  m  library  yielded  4  types  and  clones  in  the  Jan.  150  m  library 
consisted  of  12  types  (Table  1). 

Diversity  and  distribution  of  RFLP  types.  Of  the  19  RFLP  types  recovered  in  the  three 
libraries,  2  RFLP  types  (A  and  B)  were  found  in  all  three  libraries  and  2  (C  and  E)  were 
found  in  two  of  the  three  libraries  (both  in  the  June  75  m  and  the  Jan.  150  m  libraries) 
(Table  1).  The  distribution  of  clones  among  the  RFLP  types  was  quite  skewed,  with  the  4 
RFLP  types  found  in  more  than  one  library  accounting  for  126  of  the  152  clones 
examined.  The  remaining  15  RFLP  types  were  each  represented  by  4  clones  or  fewer, 
with  9  RFLP  types  found  only  once  (Table  1). 

Seven  of  the  nineteen  environmental  RFLP  types  were  also  found  in  cultures.  The  four 
most  prevalent  types  (A,  B,  C  and  E)  all  had  counterparts  in  cultures  ( Prochlorococcus 
MIT9312,  Synechococcus  WH8012,  Prochlorococcus  NATL2A  and  Prochlorococcus 
MED4,  respectively)  (Table  1).  Patterns  identical  to  Prochlorococcus  NATL1A, 
Synechococcus  WH8103  and  Synechococcus  WH9908  were  also  found  in  the  clone 
library  (RFLP  types  D,  F  and  S)  (Table  1). 
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Table  1.  Clones  assigned  to  each  RFLP  type.  For  each  field  sample  library  clones  were 
numbered  sequentially  in  the  order  they  were  picked  off  solid  media.  Numbers  shown  in 
bold  were  selected  for  sequencing. 


RFLP  type 

June  75  m 

Jan  15  m 

Jan  150  m 

number 

sequenced 

A 

3,6,10,13,15 

26,29,48,52,56, 

4,20,21,26,27,33,34 

11 

16,17,29,33, 

65,67,71,88,90, 

39,41,48,52,62,63, 

35,51,60,62, 

108,112,127, 

66,68,73,77,79,82, 

66 

137,167,171, 

87,90,96,100,104, 

177 

111,118,123,125,12 

7,131,132,143,146, 

148,149,152,155 

B 

4,  26 

3,47,59,93,109, 

15,17,19,28,42,46, 

11 

123,124,125, 

49,101,103,107,109 

134,138,140, 

110,120,121,134, 

141,156 

141,142,151,153 

C 

1,5,7,18,19, 

13,106,124 

5 

30,31,43,44, 

47,  54,58 

D 

37,45,46,61 

3 

E 

34,49,53,55, 

1 

3 

56,57,  59,64 

F 

27,32 

1 

G 

14,28,38 

1 

H 

20,42 

1 

I 

63 

1 

J 

96 

1 

K 

187 

1 

L 

59,144,145 

3 

M 

86,88,112 

3 

N 

47 

1 

O 

10 

1 

P 

56 

1 

Q 

78 

1 

R 

115 

1 

S 

150 

1 

Total 

51 
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In  order  to  assess  how  well  the  clones  screened  represented  the  diversity  present  in  the 
clone  libraries  the  cumulative  numbers  of  RFLP  types  were  plotted  as  they  appeared 
during  sampling  (Fig.  4)  (Moyer,  et  al.  1994).  As  a  library  becomes  more  thoroughly 
sampled,  the  examination  of  more  clones  will  not  result  in  novel  RFLP  types  recovered 
and  the  cumulative  plots  will  reach  a  plateau  corresponding  to  the  actual  number  of  types 
in  the  library.  Since  the  curves  should  asymptotically  approach  the  true  number  of  RFLP 
types  in  the  library  their  shape  is  analogous  to  Michalis-Menten  curve  and  therefore  in  a 
double-reciprocal  plot,  analogous  to  a  Lineweaver-Burke  plot,  the  reciprocal  of  the  y- 
intercept  is  the  total  number  of  RFLP  types  (Polz,  et  al.  1999).  Double  reciprocal  plots  of 
the  data  in  Fig.  2  yield  y-intercept  values  of  8.4,  3.0  and  1 1.6  for  the  June  75  m,  Jan.  15 
m  and  Jan.  150  m  libraries  respectively.  These  are  quite  similar  to  the  observed  values 
(9, 4  and  12)  indicating  that  the  libraries  were  almost  completely  sampled.  It  should  be 
emphasized  that,  because  of  the  biases  that  can  occur  in  steps  used  to  construct  the  library 
(chiefly  the  PCR),  this  coverage  refers  to  the  completeness  of  sampling  the  diversity  of 
clones  in  the  library,  but  cannot  be  extended  to  conclude  that  the  sequence  diversity  in  the 
environment  has  been  fully  examined. 

Phylogenetic  Analysis.  To  confirm  the  close  relationship  of  seven  of  the  RFLP  types  to 
their  respective  cultures  and  to  assign  a  phenotype  to  the  remaining  twelve  RFLP 
patterns,  the  ITS  was  sequenced  in  51  clones  (Table  1).  ITS  sequences  varied  in  length 
from  539  to  81 1  bp,  within  the  range  of  ITS  lengths  in  cultures  of  Prochlorococcus  and 
Synechococcus  (Chapter  4).  Eleven  clones  were  sequenced  of  each  of  the  two  most 
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Number  of  clones  examined 


Figure  4.  Estimated  diversity  of  cyanobacterial  RFLP  types  in  three  clone 
libraries  from  the  Sargasso  Sea.  For  each  library  the  cumulative  number  of  RFLP 
types  found  is  graphed  as  a  function  of  the  number  of  clones  screened. 
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abundant  RFLP  types  (Table  1).  Within  a  single  RFLP  type  sequence  similarities  at  the 
16S-tRNAne  spacer  ranged  from  93-100%. 

Phylogenetic  trees  based  on  103  positions  of  the  16S-tRNADe  spacer  show  that  the 
majority  of  the  clones  fall  into  previously  defined  clusters  based  on  sequences  from 
cultures  (Fig.  5).  The  nineteen  restriction  patterns  were  grouped  in  ecotypes  based  on 
sequence  similarities  and  phylogenetic  position  of  the  sequences  (Table  2).  Seven  of  the 
eight  clusters  have  cultured  members.  In  many  cases  these  groups  were  anticipated  even 
before  sequencing  because  clones  had  identical  RFLP  patterns  with  one  restriction 
enzyme  (Fig.  3). 

Twelve  of  the  RFLP  types  can  be  identified  as  Prochlorococcus.  Nine  of  these  branch 
within  the  low  B/A  Prochlorococcus  clade.  Three  of  the  low  B/A  RFLP  types  (E,  G,  and 
H)  are  members  of  cluster  I  and  the  other  six  (A,  J,  M,  O,  P,  Q)  belong  to  the  low  B/A  II 
cluster  (Table  2,  Fig.  5).  High  B/A  Prochlorococcus  are  represented  in  the  libraries  by 
three  RFLP  types.  Two  of  these  (C  and  D)  share  RFLP  patterns  with  the  high  B/A  I 
cluster  cultures  NATL2A  and  NATL  1  A,  respectively  (Table  2).  A  third  RFLP  type  (I) 
also  branches  within  the  high  B/A  cluster  I  (Fig.  5).  However,  no  clones  were  found  with 
RFLP  patterns  like  isolates  of  the  other  three  high  B/A  Prochlorococcus  clusters  and 
none  of  the  sequenced  clones  had  high  sequence  similarities  to  those  clusters. 
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Figure  5.  Evolutionary  relationships  of  51  environmental  sequences  with  cultured  isolates  of  marine 
cyanobacteria.  Dots  indicate  sequences  from  clone  libraries  on  tree,  more  than  one  dot  on  a  line 
indicates  sequences  identical  at  the  103  positions  of  thel6S-tRNAIle  spacer  used  in  the  analysis. 
Phylogenetic  framework  inferred  using  a  distance  criterion  with  paralinear  (logdet)  distances  and 
minimum  evolution  as  the  objective  function.  Bootstrap  values  from  100  resamplings  are  to  the  left  of 
each  nod,  values  below  50  not  shown.  The  tree  found  using  likelihood  methods  assigned  clones  to  same 
sequence  clusters  and  was  not  significantly  better  as  judged  by  the  Kishino  and  Hasegawa  test.  Trees 
excluding  low  B/A  Prochlorococcus  and  employing  146  positions  of  the  16S-tRNA,le  spacer  resulted  in 
essentially  similar  branching  orders  and  exactly  the  same  cluster  assignments. 
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Table  2.  Distribution  of  RFLP  types  among  ecotypes 


Ecotypic  cluster 

Representative 

culture 

RFLP 

type 

Numbers  of  clones  found 
June  Jan.  Jan. 

75m  15m  150m 

Total 

Prochlorococcus 

MIT9312 

A 

14 

17 

37 

68 

Low  B/A II 

J 

1 

1 

M 

3 

3 

O 

1 

1 

P 

1 

1 

Q 

1 

1 

Prochlorococcus 

MED4 

E 

8 

1 

9 

Low  B/A  I 

G 

3 

3 

H 

2 

2 

Prochlorococcus 

NATL2A 

C 

12 

3 

15 

High  B/A  I 

NATL1A 

D 

4 

4 

I 

1 

1 

Synechococcus  I 

WH8012 

B 

2 

13 

19 

34 

K 

1 

1 

R 

1 

1 

Synechococcus  II 

WH8103 

F 

2 

2 

Synechococcus  HI 

L 

3 

3 

Synechococcus  IV 

WH9908 

S 

1 

1 

Unaffiliated 

N 

1 

1 

Total 

48 

32 

72 

152 

Six  of  the  nineteen  environmental  RFLP  types  could  be  designated  as  Synechococcus 
(Table  2,  Fig.  5).  Three  different  RFLP  types  (B,  K,  and  R)  branched  within 
Synechococcus  cluster  I,  which  consists  of  strains  with  low  PUB  levels  (Fig.  5).  A  fourth 
RFLP  type  (F)  is  shared  by  the  motile  Synechococcus  in  cluster  II.  A  fifth  RFLP  type  (L) 
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branched  within  Synechococcus  cluster  III.  Finally  the  sixth  RFLP  type  (S)  was  identical 
to  that  of  isolates  in  Synechococcus  cluster  IV,  and  sequence  similarity  bore  this  out. 

The  nineteenth  RFLP  type  (N)  could  not  be  affiliated  with  any  cultures.  Clone  number 
47  from  the  Jan.  150  m  sample  is  only  50-60%  similar  at  the  16S-tRNAn®  spacer  to  high 
B/A  Prochlorococcus  and  Synechococcus  (as  compared  to  more  than  90%  similarity 
within  clusters  and  generally  greater  than  75%  similarity  between  clusters).  Although 
phylogenetic  analyses  place  this  sequence  within  the  high  B/A  Prochlorococcus,  this 
branching  pattern  is  not  well  supported.  The  unstable  position  of  this  clone  is  likely  due 
to  its  large  genetic  distance  from  all  of  the  other  sequences  (Swofford,  et  al.  1996). 

The  BLAST  searching  function  of  GenBank  was  used  to  further  explore  the  possible 
identity  of  cloned  sequence  150  m-47.  A  search  using  the  full  ITS  sequence  resulted  in  a 
collection  of  cyanobacterial  and  chloroplast  sequences.  However,  these  same  sequences 
were  also  identified  as  the  best  matches  if  any  of  the  cultured  Prochlorococcus  or 
Synechococcus  sequences  were  utilized  as  input  data  to  the  search.  Thus  clone  150  m-47 
is  likely  cyanobacterial  in  origin,  but  the  specific  hits  reflect  the  limited  number  of 
cyanobacterial  ITS  sequences  in  the  database  rather  than  a  particular  affiliation  for  this 
clone.  If  the  search  was  restricted  to  the  approximately  150  bp  on  the  3’  end  of  the  16S 
rDNA,  then  the  best  match  was  the  16S  rDNA  sequence  from  the  high  B/A 
Prochlorococcus  isolate  SS120,  followed  by  other  Prochlorococcus  and  marine 
Synechococcus  isolates. 
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Biases  in  distribution  of  clones.  Because  of  their  distinctive  pigment  fluorescences 
Prochlorococcus  and  Synechococcus  populations  can  be  enumerated  individually  by  flow 
cytometry,  thus  it  is  possible  to  compare  the  relative  proportions  of  clones  assigned  to 
each  of  the  two  genera  with  the  proportions  of  cells  in  the  sample  each  library  was 
constructed  from.  This  calculation  provides  some  indication  of  bias  in  the  clone  libraries. 
All  three  libraries  over-represent  Prochlorococcus  with  respect  to  Synechococcus  by 
factors  ranging  from  1.1  to  2.3  (Table  3). 


Table  3.  Estimates  of  relative  bias  in  clone  libraries.  Prochlorococcus 
overestimation  factor  was  determined  for  each  sample  by  dividing  the  ratio  of 
Prochlorococcus  to  Synechococcus  (Pro:Syn)  in  the  PCR  clone  libraries  by  the 
ratio  of  Pro:Syn  cell  numbers. 


June  75m 

Jan.  15  m 

Jan.  150  m* 

PCR  clone  libraries 

Prochlorococcus 

44 

18 

48 

Synechococcus 

4 

14 

23 

Pro:Syn  ratio 

11 : 1 

1.3: 1 

2.09  : 1 

Cell  concentrations  (ml'1) 

Prochlorococcus 

73843 

19000 

15400 

Synechococcus 

15700 

15900 

13100 

Pro:Syn  ratio 

4.7:1 

1.19  : 1 

1.17 : 1 

Prochlorococcus 

overestimation  factor 

2.34 

1.1 

1.79 

*Only  71  clones  are  included  in  analysis  of  Jan.  150  m  clone  library  as  one  clone  could  not  be 
confidently  assigned  to  either  Prochlorococcus  or  Synechococcus 
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DISCUSSION 


The  coexistence  of  multiple  cyanobacterial  ecotypes  in  single  water  samples  has  been 
repeatedly  demonstrated  with  both  culture  and  sequence  data  (Toledo  and  Palenik  1997, 
Ferris  and  Palenik  1998,  Moore,  et  al.  1998,  Urbach  and  Chisholm  1998).  Here,  of  eight 
clusters  found,  two  were  found  in  all  three  libraries  and  two  more  were  found  in  two  of 
the  three.  These  results  are  consistent  with  analysis  of  petB/D  intergenic  region 
sequences  in  the  Sargasso  Sea  and  Gulf  Stream  in  which  overlapping  alleles  were  found 
in  multiple  libraries  from  a  depth  profile  (Urbach  and  Chisholm  1998).  This  was 
interpreted  as  populations  in  the  water  column  sharing  a  common  gene  pool,  with 
different  alleles  potentially  changing  in  numeric  abundance  as  conditions  changed.  A 
similar  situation  may  exist  over  the  course  of  a  year.  It  is  likely  that  several  ecotypes  are 
present  all  year  round,  and  thus  are  available  to  be  detected  using  PCR-based  methods. 
However,  the  relative  abundances  may  change  throughout  the  year  as  conditions  change 
in  the  water  column. 

The  majority  of  sequences  in  this  study  were  related  to  an  isolate  in  culture  (more  than 
90%  similar  at  the  16S-tRNAIle  spacer).  The  high  proportion  of  sequences  related  to 
isolates  raises  concerns  about  the  possibility  that  the  environmental  PCR  samples  could 
have  been  contaminated  with  culture  DNA  (especially  already  amplified  ITS  sequences) 
existing  in  the  lab.  However,  although  a  few  environmental  sequences  are  identical  to 
those  of  cultures,  the  majority  are  merely  quite  similar,  which  would  not  be  expected  if 
they  were  contaminants.  In  theory  these  small  differences  could  be  the  result  of 
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microheterogeneity  at  multiple  copies  of  the  ITS  in  a  single  organism,  which  may  not 
have  been  observed  in  sequences  obtained  from  PCR  products  in  Chapter  4.  However, 
preliminary  data  from  the  complete  genome  of  Prochlorococcus  MED4 
(http://bbrp.llnl.gov/jgi/microbial/prochlorococcus_homepage.html)  and  from  southern 
blotting  of  a  suite  of  isolates  (Urbach,  et  al.  1998,  West  and  Scanlan  1999)  indicates  that 
Prochlorococcus  has  only  one  ribosomal  RNA  operon.  Further,  contamination  as  a 
source  can  be  absolutely  ruled  out  for  all  of  the  Synechococcus  sequences  found,  as  all 
three  clone  libraries  were  constructed  before  any  of  the  Synechococcus  strains  described 
in  Chapter  4  were  present  in  the  lab. 

One  clone  (150  m-47)  was  not  at  all  closely  related  to  any  available  sequences  from 
cultures.  Cyanobacterial  sequences  of  uncertain  phylogenetic  affiliation  have  been 
observed  previously  in  environmental  clone  libraries.  The  branching  pattern  of  SAR7,  a 
16S  rDNA  sequence  from  the  Sargasso  Sea  (Giovannoni,  et  al.  1990),  cannot  be 
confidently  determined  within  the  marine  picoplankton  clade  (Urbach,  et  al.  1998,  Rocap, 
et  al.  1999).  However,  SAR7  is  very  likely  either  a  high  B/A  Prochlorococcus  or  a 
Synechococcus,  while  clone  150  m-47  is  different  enough  from  both  Prochlorococcus 
and  Synechococcus  sequences  to  suggest  it  may  represent  a  different  genus  altogether. 
Although  Prochlorococcus  and  Synechococcus  are  by  far  the  numerically  dominant 
cyanobacteria  in  the  open  ocean,  they  are  not  the  only  genera  present.  In  the  Pacific 
larger  (~2-3  pm)  orange  fluorescing  cells  have  been  observed  and  tentatively  identified  as 
Synechocystis  (Campbell,  et  al.  1997,  Neveux,  et  al.  1999).  It  is  certainly  possible  for  the 
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ITS  from  such  an  organism  to  have  been  amplified  in  this  study,  as  the  primer  16S-1247f 
was  designed  to  exclude  heterotrophic  bacterial  sequences,  but  is  not  specific  to  the 
Prochlorococcus/ marine  Synechococcus  clade. 

The  overrepresentation  of  Prochlorococcus  relative  to  Synechococcus  in  the  clone 
numbers  as  compared  to  the  cell  numbers  is  evidence  that  biases  existed  somewhere  in 
the  process  of  turning  cells  into  PCR  clone  libraries.  Isolation  of  DNA  is  one  step  where 
biases  may  have  occurred,  as  Prochlorococcus  and  Synechococcus  may  have  different 
DNA  extraction  efficiencies  for  the  protocol  employed.  The  PCR  is  another  likely 
candidate  for  introducing  bias  (Suzuki  and  Giovannoni  1996,  Polz  and  Cavanaugh  1998), 
although  attempts  were  made  to  avoid  conditions  known  to  induce  PCR  bias  where 
possible.  Multiple  PCR  reactions  were  pooled  before  cloning  to  reduce  the  influence  of 
early  random  events  (Polz  and  Cavanaugh  1998).  The  number  of  cycles  was  limited  to 
the  amount  (30)  where  PCR  products  were  barely  visible  on  agarose  gels.  However, 
other  strategies  such  as  starting  with  high  template  concentrations  in  reactions  and 
avoiding  degenerate  primers  (Polz  and  Cavanaugh  1998)  were  not  feasible  because  of 
low  recovery  of  DNA  from  environmental  samples  and  the  limited  database  of 
cyanobacterial  23S  rDNA  sequences  available.  The  template  of  interest  may  also  be  an 
issue,  as  the  differences  in  ITS  length  and  %GC  content  between  the  two  genera  could 
lead  to  differential  amplification,  specifically  preferential  amplification  of  shorter 
templates,  such  as  low  B/A  Prochlorococcus  (Mutter  and  Boynton  1995).  This  appears 
to  be  a  likely  explanation  for  the  results  observed  here,  although  multiple  factors  may 
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have  contributed  to  the  overrepresentation  of  Prochlorococcus  in  the  PCR  clone  libraries. 
Similar  biases  undoubtedly  exist  within  each  genus  (i.e.  among  the  different  ecotypes  of 
Prochlorococcus  and  Synechococcus ). 

Because  of  the  demonstrated  and  potential  biases,  information  on  the  relative  distribution 
of  the  ecotypes  must  be  garnered  using  quantitative  methods  such  as  probing  (Field,  et  al. 
1997)  or  Q-PCR  (Lee,  et  al.  1996,  Tay,  et  al.  2000).  The  discrete  number  of  clusters 
identified  by  the  ITS  will  be  advantageous  for  the  design  of  ecotype  specific 
oligonucleotides  to  quantitate  abundances  (see  Chapter  6).  The  ultimate  test  of  a 
sequence  similarity  cluster  as  an  ecologically  discrete  population  is  if  it  has  a  unique 
distribution  in  time  and  space  (Palys,  et  al.  1997).  Thus,  the  ability  to  measure  the 
abundances  of  each  cluster  could  confirm  that  they  represent  ecologically  distinct 
organisms  as  well  as  provide  insights  into  factors  that  may  regulate  their  distributions. 

CONCLUSIONS 

This  study  demonstrates  the  utility  of  the  ITS  for  species  level  diversity  studies,  as 
environmental  ITS  sequences  could  be  readily  grouped  into  sequence  clusters. 

Consistent  with  previous  studies  demonstrating  that  it  is  not  possible  to  make  quantitative 
conclusions  about  genotype  distributions  on  the  basis  of  frequencies  in  clone  libraries,  the 
number  of  clones  found  was  skewed  towards  Prochlorococcus.  However,  despite  the 
presumably  different  set  of  biases  associated  with  culturing  environmental 
microorganisms,  the  overwhelming  majority  of  sequences  found  were  closely  related  to 
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those  of  cultures.  Confirmation  of  this  result  and  exploration  of  the  potential  different 
distributions  of  the  ecotypes  awaits  the  development  of  quantitative  methods  for 
determining  the  abundances  of  the  ecotypes  of  Prochlorococcus  and  Synechococcus. 
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CHAPTER  SIX 


Future  Directions 
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A  major  contribution  of  this  thesis  is  the  identification  of  sequence  similarity  clusters  of 
Prochlorococcus  and  Synechococcus  that  may  correspond  to  ecologically  distinct 
populations.  However,  the  ecologically  relevant  physiological  differences  between  the 
strain  clusters  that  may  allow  them  to  occupy  unique  niches  are  not  clear  in  all  cases. 
Further  laboratory  work,  using  representative  strains  from  each  cluster,  will  be  required 
to  determine  the  potential  roles  of  nutrients,  metals,  temperature  and  other  factors  in 
ecotypic  differentiation.  Several  experiments  of  this  sort,  including  the  exploration  of 
potential  ecotypic  differences  in  the  utilization  of  organic  nutrients,  in  requirement  of 
cobalt  for  growth  and  in  the  expression  of  phycoerythrin  are  currently  underway. 

Although  much  discussion  has  focused  on  factors  that  may  differentially  regulate  the 
growth  of  the  ecotypes,  differential  mortality  may  also  be  an  important  determinant  of 
distributions.  Viruses  are  known  to  influence  distributions  of  Synechococcus,  as  natural 
populations  are  resistant  to  their  co-occurring  phages  (Waterbury  and  Valois  1993).  The 
same  is  likely  true  for  Prochlorococcus,  although  the  search  for  viruses  which  infect 
Prochlorococcus  is  still  underway.  Selective  grazing  may  also  be  an  important 
phenomenon  in  the  wild  (Christaki,  et  al.  1999).  As  little  is  known  about  the  specific 
interactions  of  marine  cyanobacteria  with  their  grazers,  this  area  is  ripe  for  exploration. 

The  ten  ecotypic  clusters  of  Prochlorococcus  and  Synechococcus  identified  in  chapter  4 
may  not  be  an  exhaustive  set,  and  the  existence  additional  clusters  should  be  actively 
explored.  Because,  as  of  this  writing,  only  six  independent  high  B/A  isolates  of 
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Prochlorococcus  exist  in  culture,  continued  efforts  should  be  made  to  obtain  new 
Prochlorococcus  isolates,  with  emphasis  on  high  B/A  strains.  To  this  end,  particular 
attention  should  be  paid  to  trying  different  physical  methods  of  isolation  (i.e.  flow 
cytometric  sorting,  which  has  been  successful  for  five  isolates,  see  Chapter  2  and  Chapter 
4,  Tablel)  and  alternative  media  formulations  which  may  select  for  different  strain  types. 
Continued  isolation  efforts  are  also  warranted  for  Synechococcus,  although  this  study  did 
not  exhaustively  sample  the  physiological  diversity  already  present  in  culture  collections. 
Thus,  an  immediate  next  step  would  entail  obtaining  ITS  sequences  from  additional 
Synechococcus  isolates,  in  particular  high  PUB  non-motile  strains,  low  PUB  motile 
strains,  and  psychrophilic  strains.  It  is  also  of  interest  to  sequence  the  ITS  of  some  of  the 
strains  analyzed  using  rpoCl  in  order  to  be  able  to  compare  the  clusters  found  with  the 
two  loci  (Toledo  and  Palenik  1997,  Toledo,  et  al.  1999). 

The  phylogenetic  affiliation  of  clone  150m  -47  from  Chapter  5  remains  puzzling. 

Because  the  PCR  strategy  used  in  generating  the  clone  libraries  involved  a  primer  about 
halfway  through  the  23S  rDNA,  (the  ITS  was  sequenced  using  an  internal  primer) 
additional  23S  rDNA  sequence  (at  least  1200  bp)  could  be  obtained  from  this  clone. 
However,  this  might  be  of  limited  utility,  as  there  are  not  currently  a  large  number  of 
cyanobacterial  23S  rDNA  sequences  publicly  available  for  comparison  (i.e.  the  latest 
release  of  the  RDP  (Maidak,  et  al.  1999)  contains  only  one,  the  freshwater  Synechococcus 
strain  PCC6301).  Another  strategy  might  be  to  examine  the  ITS  sequences  of  marine 
Synechocystis  strains  WH8501  and  WH8502  (Waterbury  and  Rippka  1989)  to  see  if  they 
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show  a  greater  sequence  similarity  to  clone  1 50m-47  than  the  Prochlorococcus  and 
Synechococcus  sequences  do. 

Further  efforts  can  also  be  made  towards  assessing  diversity  in  natural  populations.  The 
data  presented  in  Chapter  5  was  the  beginning  of  connecting  information  about 
physiological  diversity  to  genetic  diversity  in  field  populations  of  marine  cyanobacteria. 
However,  three  PCR  clone  libraries  from  two  water  columns  from  one  ocean  is  probably 
not  a  thorough  sampling  of  the  existing  diversity  of  marine  cyanobacterial  populations.  If 
additional  libraries  are  constructed,  sampling  locations  should  be  chosen  with  care.  The 
observation  that  isolates  from  different  oceans  have  identical  ITS  sequences  suggests  that 
simple  geographic  location  is  not  the  most  important  component  to  consider  when 
selecting  sampling  sites.  Rather,  additional  samples  should  be  from  environments  with 
diverse  physical  conditions,  which  may  select  for  alternate  ecotypes.  For  example,  the 
Gulf  of  Aqaba  in  the  Red  Sea  undergoes  winter  mixing  so  deep  that  Prochlorococcus 
disappear  for  a  period  of  time  most  years  (Lindell  and  Post  1995).  It  would  be  interesting 
to  examine  the  genetic  makeup  of  the  last  Prochlorococcus  after  the  onset  of  mixing,  as 
well  as  the  first  populations  to  come  back  in  the  spring.  Similarly  late  summer  or  early 
fall  in  the  Sargasso  Sea,  when  waters  have  been  stratified  for  months,  would  provide  a 
different  water  column,  and  potentially  different  populations  than  those  presented  here 
(DuRand,  et  al.  2000). 
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However,  given  the  drawbacks  associated  with  examining  PCR  amplified  sequences,  the 
next  obvious  step  is  the  quantitative  assessment  of  the  ecotypes.  This  is  possible  using 
signature  sequences  specific  to  each  ecotype  as  probes  in  quantitative  or  in  situ 
hybridizations  (Giovannoni,  et  al.  1988,  DeLong,  et  al.  1989),  or  as  primers  in 
quantitative  PCR  (Lee,  et  al.  1996,  Tay,  et  al.  2000).  An  examination  of  a  consensus 
sequences  for  each  cluster  over  the  first  -150  bp  of  the  23S  rDNA  (Fig.  1)  reveals  that 
such  signature  sequences  can  be  designed.  For  example,  a  20-mer  from  positions  18-38 
could  be  used  to  develop  a  probe  specific  for  low  B/A  Prochlorococcus,  one  for  high  B/A 
cluster  I  Prochlorococcus,  one  for  high  B/A  cluster  IV  Prochlorococcus,  and  one  for  all 
four  clusters  of  marine  Synechococcus.  All  of  these  probes  have  at  least  one  mismatch 
from  other  sequences,  the  low  B/A  one  has  two. 

It  is  not  possible  to  find  potential  probes  for  all  of  the  combinations  of  clusters  in  this 
region  of  the  23S  rDNA.  For  example,  there  is  not  a  sequence  that  is  specific  for  all  of 
the  high  B/A  Prochlorococcus  to  the  exclusion  of  the  low  B/A  Prochlorococcus  and 
Synechococcus.  It  is  possible  that  such  a  sequence  could  be  found  by  sequencing  more  of 
the  23S  rDNA,  but  that  may  not  be  likely  given  the  diverging  branching  patterns  of  the 
high  B/A  isolates.  However,  by  using  a  judicious  combination  of  probes  it  should  be 
possible  to  quantitate  the  clusters  of  interest.  Not  only  will  this  enable  the  testing  of 
current  hypotheses  about  the  distribution  of  the  high  and  low  B/A  ecotypes  with  depth, 
but  data  on  the  distribution  of  the  ecotypes  would  undoubtedly  generate  new  hypotheses 
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Figure  1.  Consensus  of  partial  23S  rDNA  sequences  of  Prochlorococcus  and 
Synechococcus  ecotypes.  Sequences  from  cultures  and  environmental  PCR  clone 
libraries  (Chapters  4  and  5)  were  used  to  determine  a  strict  consensus  sequence  for  each 
cluster.  Dots  at  a  base  position  indicate  an  identical  base  to  that  of  low  B/A  cluster  I. 
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that  could  be  explored  with  laboratory  cultures,  and  might  provide  hints  about  the 
physiology  of  “types”  known  only  from  cloned  sequences. 

Finally,  the  complete  genome  sequence  of  the  low  B/A  Prochlorococcus  isolate  MED4 
will  be  available  by  the  end  of  the  year  (for  the  status  of  the  project  to  date  see 
http://bbrp.llnl.gov/jgi/microbial/prochlorococcus_homepage.html).  This  will  provide  a 
wealth  of  new  information  and  likely  drive  Prochlorococcus  research  in  new  directions. 
Possible  benefits  from  the  genome  sequence  include  a  better  understanding  of  the 
photosynthetic  apparatus,  of  nutrient  acquisition  or  metal  detoxifying  mechanisms,  and  of 
the  extent  of  horizontal  gene  transfer.  Further,  due  to  the  rapid  advances  and  dropping 
costs  associated  with  sequencing  technology  it  is  likely  that  complete  genome  sequences 
for  a  high  B/A  Prochlorococcus  and  a  marine  Synechococcus  strain  will  be  determined  in 
the  next  few  years.  The  comparative  genomics  approach  could  be  very  powerful  in 
understanding  the  adaptations  of  each  ecotype,  and  perhaps  the  costs  associated  with 
those  adaptations. 
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1.  Introduction 

Prochlorococcus  is  responsible  for  a  significant  fraction  of  the  photosynthesis  in  the 
open  ocean.  This  marine  prokaryote  often  thrives  in  deep  waters  where  irradiance  levels 
are  0.1%  to  1%  of  surface  values.  The  ability  of  Prochlorococcus  to  flourish  in  low  light 
environments  may  be  due  to  unique  functional  and  structural  characteristics  of  its 
photosynthetic  apparatus.  Molecular  phylogenies  based  on  16S  rRNA  gene  analyses 
indicate  that  Prochlorococcus  is  a  member  of  the  cyanobacterial  lineage  (1).  However, 
unlike  its  cyanobacterial  counterparts,  Prochlorococcus  has  a  chlorophyll  a2/b2  light¬ 
harvesting  complex  (2),  and  evidence  has  not  been  obtained  for  the  presence  of  functional 
phycobili somes.  Hess  et  al.  (3)  have  reported  that  genes  encoding  the  a-  and  (3-subunits 
of  phycoerythrin  (PE)  are  present  in  two  (SSI 20,  TATL2)  of  the  six  Prochlorococcus 
isolates  which  they  examined.  Both  SS120  (120  m.  Sargasso  Sea)  and  TATL2  (30  m. 
Tropical  Atlantic)  were  isolated  from  the  Atlantic.  Three  other  isolates  were  also  from 
the  Atlantic  (10-30  m),  and  one  was  from  the  Mediterranean  (5  m)  (3).  Although  the  PE 
genes  are  expressed  at  low  levels  in  SS120  and  the  gene  products  appear  to  bind 
chromophores  (3),  the  function  of  PE  in  this  organism  remains  unknown.  Furthermore,  it 
has  not  been  determined  if  the  unique  characteristics  of  the  SSI 20  PE  sequence  (3)  are 
conserved  between  Prochlorococcus  isolates. 

We  have  in  culture  another  Prochlorococcus  isolate,  MIT9303,  from  100  m  in  the 
Sargasso  Sea.  Among  all  of  the  cultured  isolates,  it  has  the  highest  degree  of  16S  rRNA 
sequence  similarity  to  marine  Synechococcus,  the  group  from  which  Prochlorococcus 
may  have  evolved  (1,4).  Using  Southern  blot  analyses  and/or  PCR  amplification,  we 
detected  the  a-  and  (3-PE  genes  in  MIT9303,  but  not  in  two  other  isolates  from  different 
sites.  The  predicted  amino  acid  sequences  of  PE  of  MIT9303  and  SS120  (3)  differ  in 
their  number  of  putative  chromophore  binding  sites  and  in  many  other  positions.  Our 
ultrastructural  studies  did  not  reveal  phycobilisome-like  structures  associated  with  the 
internal  membranes  of  MIT9303.  At  present,  the  physiological  role  of  PE  in 
Prochlorococcus  remains  to  be  established. 
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2.  Procedure 

Prochlorococcus  cells  were  grown  in  Sargasso  Sea  water  enriched  with  nutrients,  at 
21°C  on  a  14  hr  light  (10  |imol  photons  m'2  s'1)  /  10  hr  dark  cycle.  Cells  were  prepared 
for  electron  microscopy  as  described  in  (5).  For  our  Southern  blots,  the  SSI 20  PE  genes 
were  amplified,  labeled  with  fluorescein  (ECL  Random  Prime  Labeling  and  Detection 
Systems,  Amersham,  IL),  and  utilized  as  the  probe.  Genomic  DNA  was  purified  (6), 
digested  with  Hindlll,  separated  on  an  agarose  gel,  and  transferred  to  nitrocellulose. 
These  membranes  were  hybridized  with  the  PE  gene  probe,  blocked,  incubated  with  anti- 
fluorescein-HRP  conjugate  and  detection  solutions,  and  exposed  to  autoradiography  film. 
For  the  characterization  of  PE  genes,  genomic  DNA  was  isolated  from  MIT9303,  and 
genes  encoding  the  p~  and  a-subunits  of  PE  were  amplified  using  primers  specific  for 
cpeB  (forward  primer:  ATGCTTGATGCATTCTCAAG)  and  cpeA  (reverse  primer: 
AAGGCATTAATAAGGTAATC)  of  SS120  (3).  PCR  products  were  separated  on  an 
agarose  gel,  purified,  and  cloned  using  the  pCR-Script  kit  (Stratagene,  CA).  Plasmid 
DNA  from  several  independent  clones  was  sequenced  on  an  automated  sequencer 
(LICOR,  NE).  Sequences  were  aligned  using  the  Genetic  Data  Environment. 

3.  Results  and  Discussion 

Genes  encoding  PE  were  detected  previously  in  two  out  of  six  Prochlorococcus 
isolates  examined  (3),  and  Hess  et  al.  (1996)  suggested  that  during  evolution,  the 
development  of  a  chlorophyll  a/b  light-harvesting  complex  in  Prochlorococcus  led  to  the 
disappearance  of  phycobilisomes  in  some  lines  (3).  We  used  Southern  blot  analyses  to 
confirm  that  cpeA  and  cpeB  are  absent  from  MED4,  as  previously  reported  (3),  and  are 
also  absent  from  NATL2A  and  MIT9313  (data  not  shown).  These  latter  isolates  were 
originally  obtained  from  the  Atlantic,  with  NATL2A  originating  from  10  m  (North 
Atlantic)  and  MIT9313  from  135  m  (Gulf  Stream).  However,  cpeA  and  cpeB  are  present 
in  MIT9303.  Using  primers  specific  for  the  SSI 20  PE  genes  and  the  PCR  technique,  we 
amplified  a  region  of  about  1000  base  pairs  from  the  genomic  DNA  of  M1T9303. 
Sequencing  confirmed  the  presence  of  cpeA  and  cpeB  on  this  fragment. 

The  MIT9303  cpeA  gene  is  similar  to  cpeA  of  SS120  (3),  and  does  not  contain 
additional  codons  or  deletions.  The  cpeA  genes  of  both  MIT9303  and  SS 120  are  slightly 
smaller  than  cpeA  of  marine  Synechococcus  sp.  strain  WH7803  (7)  because  of  two 
stretches  of  deletions  (12-15  nucleotides  each)  in  less-conserved  regions  of  the  gene. 
Although  the  cpeB  gene  of  MIT9303  is  also  similar  to  cpeB  of  SS120  (3),  it  is  missing  a 
codon  which  encodes  Asn-156  in  SS120.  In  addition,  both  the  MIT9303  and  SS120  cpeB 
genes  are  missing  two  codons  which  encode  Lys-157  and  Met- 158  in  WH7803  (7). 

The  predicted  amino  acid  sequences  of  the  P-subunits  of  MIT9303  (sequence  not 
shown)  and  SS120  (3)  share  approximately  73%  amino  acid  identity.  In  the  P-subunit, 
the  four  cysteines  (Cys-50,  -61,  -82,  -165),  which  are  important  in  chromophore  binding 
(7-10),  are  conserved  in  MIT9303.  In  addition,  residues  which  are  involved  in 
chromophore  interaction  (Arg-77,-78,  Ala-81,  Arg-84,  Asp-85)  (9)  are  also  conserved. 
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The  a-PE  subunits  of  MIT9303  and  SSI 20  share  approximately  65%  amino  acid 
identity  (Fig.  1).  An  important  difference  in  the  predicted  amino  acid  sequence  of  a-PE 
of  MIT9303  involves  the  presence  of  two  cysteine  residues  (Cys-73,  Cys-130),  which  are 
putative  chromophore  binding  sites  (Fig.  1).  The  a-PE  of  SS120  contains  only  Cys-  73, 
and  has  an  Asp  residue  substituted  for  Cys-130  (3).  While  Cys-73  is  homologous  to  a 
bilin  attachment  site  (Cys-88)  that  is  conserved  in  almost  all  phycobiliproteins  (9),  Cys- 
130  is  homologous  to  a  chromophore  attachment  site  at  Cys- 140  in  organisms  including 
Synechococcus  sp.  strains  WH7803  (Class  I  PE)  (7),  and  WH8103  and  WH8020  (Class  H 
PE)  (8).  The  PE  (a+{3  subunits)  of  MIT9303  has  a  total  of  five  putative  chromophore 
attachment  sites,  a  characteristic  associated  with  Class  I  PE  (7-10).  In  the  a-PE  of  both 
MIT9303  and  SS120  (3),  Arg-75  and  Asp-76,  which  are  highly  conserved  residues 
involved  in  maintaining  proper  chromophore  conformation  (9),  are  also  present. 
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Figure  1. 

Predicted 

amino  acid 

sequences  of  a-PE  of  MIT9303  and  SSI 20  (3). 

The  last 

eight  amino  acid  residues  of  the  MIT9303  a-PE  sequence  have  not  been  shown  because 
their  corresponding  nucleotides  were  used  as  the  primer  in  the  original  amplification  of 
the  PE  gene  in  this  isolate.  (*)  indicate  conserved  Cys  that  are  putative  chromophore 
attachment  sites.  (•)  represent  functionally  important  residues  discussed  in  the  text. 

In  order  to  determine  whether  the  intracytoplasmic  membranes  of  M1T9303  are 
associated  with  phycobilisome-like  structures,  we  examined  the  ultrastructure  of  this 
isolate.  In  MIT9303,  the  intracytoplasmic  lamellae  are  organized  into  parallel  bands, 
which  extend  the  length  of  the  cell  and/or  form  concentric  circles  near  the  periphery  of 
the  cytoplasmic  membrane  (Figs.  2A,  2B).  The  lamellae  in  these  preparations  were 
tightly  appressed  and  lacked  phycobilisome-like  structures  (Figs.  2A,  2B).  This 
organization  and  the  overall  ultrastructure  of  MIT9303,  are  similar  to  what  we  have 
observed  for  other  Prochlorococcus  isolates  (5,1 1). 
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Figures  2A  and  2B.  Transmission  electron  micrographs  of  thin  sections  of  MIT9303.  A.) 
Longitudinal  section  B.)  Cross-section 

The  possession  of  cpeA  and  cpeB  is  a  characteristic  that  is  not  conserved  among 
Prochlorococcus  isolates.  An  understanding  of  the  driving  forces  which  may  have 
resulted  in  the  acquisition  and/or  loss  of  these  genes  is  linked  intimately  with  further 
studies  on  the  function  of  PE  in  Prochlorococcus.  These  studies  will  also  allow  us  to 
determine  whether  the  presence  of  cpeA  and  cpeB  in  M1T9303  and  SSI 20  confers  a 
biological  advantage  under  specific  light  and/or  nutrient  conditions. 
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Since  the  publication  of  Chapters  2  and  3  of  this  thesis  (Moore,  et  al.  1998,  Rocap,  et  al. 
1999),  additional  marine  cyanobacterial  16S  rRNA  genes  have  been  sequenced,  both 
from  cultures  (D.  Distel  and  J.  Waterbury,  unpublished  data)  and  from  the  environment 
(West  and  Scanlan  1999).  Further,  sequences  from  additional  freshwater  Synechococcus 
suggest  that  the  Synechococcus  group  as  a  whole  is  paraphyletic  within  the 
cyanobacterial  radiation,  and  that  the  strain  Synechococcus  PCC6301,  which  was  used  as 
the  outgroup  in  Chapters  2  and  3,  is  in  a  different  lineage  than  the  marine  picoplankton 
clade  (Honda,  et  al.  1999). 

Here  I  present  a  phylogenetic  analysis  of  the  Prochlorococcus  16S  rDNA  sequences  from 
Chapters  2  and  3  with  the  new  sequences  now  available.  The  trees  are  rooted  with  the 
freshwater  cyanobacterial  strain  PCC6307  since  it  is  more  closely  related  to  the  marine 
picoplankton  clade  than  Synechococcus  PCC6301  (Honda,  et  al.  1999).  I  also  present  an 
extended  version  of  the  methods  used  to  acquire  the  sequences  in  Chapters  2  and  3, 
which  were  not  elaborated  in  those  chapters,  because  of  the  space  constraints  imposed  by 
the  journal  in  Chapter  2  and  because  Chapter  3  was  written  to  review  the  state  of 
knowledge  about  the  ecotypes  of  Prochlorococcus  at  the  end  of  1998. 

METHODS 

Culturing  and  DNA  isolation.  Cultures  were  grown  in  60  ml  acid  washed 
polycarbonate  bottles  in  Pro2  media  (see  Chapter  4).  Genomic  DNA  was  isolated  from 
late  exponential  phase  cultures  using  a  protocol  adapted  from  standard  methods  (Ausubel, 
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et  al.  1992).  Cells  were  spun  in  an  ultracentrifuge  for  20  min  at  10,000  rpm  at  4°  C.  The 
pellet  was  transferred  to  a  1.5  ml  tube  and  spun  in  a  microcentrifuge  for  10  min  at  max 
speed.  The  supernatant  was  removed,  the  pellet  was  resuspended  in  430  pi  TE  and  17  pi 
50  mg/ml  lysozyme  (final  concentration  2  mg/ml)  was  added.  The  sample  was  incubated 
at  37°  C  for  30  min  and  30  pi  10%  SDS,  60  pi  5%  Sarcosyl  and  60  pi  5%  TritonX-100 
were  added  (all  final  concentration  0.5%).  The  sample  was  incubated  at  37°  C  for  1  hour 
and  100  pi  5M  NaCl  and  80  pi  CTAB/NaCl  (10%  CTAB,  0.7  M  NaCl)  were  added  and 
the  sample  was  incubated  in  a  65°  C  water  bath  for  10  min.  The  sample  was  extracted 
with  an  equal  volume  of  24: 1  chloroform:isoamyl  alcohol,  then  with  an  equal  volume  of 
25:24:1  phenol:chloroform:isoamyl  alcohol,  and  then  repeatedly  (usually  twice  more) 
with  24:1  chloroform:  isoamyl  alcohol  until  no  interface  was  visible  between  the  aqueous 
and  the  organic  phases.  DNA  was  precipitated  with  0.6  volumes  isopropyl  alcohol  and 
stored  at  4°  or  -20°  overnight.  The  next  day  the  sample  was  spun  for  lOmin  at  4°  C  in  a 
microfuge  at  max  speed,  the  pellet  was  washed  twice  in  70%  ethanol  and  dried  on  the 
bench  or  in  a  speed  vacuum  (using  no  heat  setting),  and  resuspended  in  30-100  pi  TE. 

For  all  samples  a  no-cell  blank  was  carried  through  the  extraction  procedure  in  parallel 
(starting  with  430  pi  TE)  and  used  as  a  control  in  PCR  reactions. 

Amplification  and  cloning  of  16S  rDNA.  The  16S  rDNA  was  amplified  using  universal 
primers  8-27f  (AGAGTTTGATCCTGGCTCAG)  and  1504-1486r 
(CTTGTTACGACTTCACCCC).  Reactions  were  done  in  a  25  pi  volume  with  final 
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concentrations  of  reactants  as  follows:  0.25  mM  dNTPs,  0.5  mM  each  primer,  01. -1  jig 
template  DNA,  IX  Pfu  Buffer  and  0. 1-0.5  U  of  the  high  fidelity  polymerase  Pfu 
(Stratagene).  Cycling  parameters  were  94°  for  5  min  followed  by  30  cycles  of  (92°  lmin, 
60°  lmin,  72°  2  min)  and  a  final  extension  at  72°  for  10  min  in  a  PTC- 100  PCR  machine 
(MJ  Research).  PCR  reactions  were  performed  in  quintuplicate  and  pooled  and  purified 
using  QIA  quick  kits  (Qiagen). 


PCR  products  were  cloned  using  pCR-Script  kits  (Stratagene).  Plasmids  were  prepared 
from  clones  using  a  standard  alkaline  lysis  minprep  method  (Ausubel,  et  al.  1992)  and 
screened  by  digestion  with  Pvu  II,  which  cuts  the  vector  pCR-Script  twice,  once  on  either 
side  of  the  insertion  site,  and  does  not  cut  Prochlorococcus  16S  rDNA  sequences. 
Plasmids  with  appropriately  sized  inserts  were  prepared  for  sequencing  using  a  high 
purity  miniprep  involving  a  phenol/chloroform  extraction  and  a  PEG/NaCl  precipitation. 

Sequencing.  All  sequencing  reactions  were  performed  using  Epicentre  Technologies 
Long-Read  cycling  sequencing  kits  (Sequi-Therm,  Sequi-Therm  Excel,  or  Sequi-Therm 
Excel  II)  according  to  the  manufacturer’s  instructions  using  dye  labeled  primers,  and  run 
on  a  LI-COR  Model  4000  automated  sequencer  (LI-COR,  Lincoln,  NE). 

For  the  sequences  in  Chapter  2  (MIT9302,  MIT9303,  MIT9312,  MIT9313),  at  least  eight 
clones  from  each  culture  were  sequenced  independently  on  one  strand  using  primers 
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M13F  and  M13R  and  plasmid  DNA  as  the  template.  For  second  strand  sequencing 
plasmids  were  pooled  in  equimolar  amounts  and  sequenced  using  internal  primers. 

For  the  sequences  in  Chapter  3  (MIT9201,  MIT9202,  MIT921 1,  MIT9215),  at  least  three 
clones  per  culture  were  sequenced  independently  on  one  strand  using  primers  M13F  and 
M13R  and  plasmid  DNA  as  the  template.  For  second  strand  sequencing  the  16S  rDNA 
was  amplified  from  genomic  DNA  as  described  above  and  sequenced  using  internal 
primers. 

Phylogenetic  analyses.  Sequences  were  manually  edited  and  aligned  using  the  Genetic 
Data  Environment  (Smith,  et  al.  1994).  Phylogenetic  analyses  were  performed  with 
PAUP*  v.  4b2a  (Swofford  1999)  and  employed  1076  unambiguously  aligned  and 
determined  positions  of  the  16S  rDNA.  Distance  trees  were  inferred  using  the  HKY85 
model  of  nucleotide  substitution.  Heuristic  searches  and  bootstrap  analyses  (100 
resamplings)  utilized  random  addition  and  tree-bisection  reconnection  (TBR)  branch 
swapping  methods.  Phylogenetic  trees  were  visualized  with  TREEVIEW  (Page  1996). 

Nucleotide  sequence  accession  numbers.  Sequences  from  Chapter  3  have  been 
deposited  in  GenBank  with  the  following  accession  numbers;  MET9201:  AF1 15268 
MIT9202:  AF1 15269  MIT921 1:  AF1 15270  and  MIT9215:  AF1 15271. 
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RESULTS 


Phylogenetic  analyses  employing  1076  bp  of  the  16S  rDNA  and  including  sequences 
from  chapter  2  and  3  as  well  as  sequences  from  marine  Synechococcus  WH8012  and 
WH81 12  (unpublished  data  courtesy  of  D.  Distel  and  J.  Waterbury)  and  environmental 
sequences  from  the  North  Atlantic  (West  and  Scanlan  1999)  demonstrate  many  of  the 
features  identified  in  chapter  4  using  the  more  variable  ITS.  The  division  of  the  low  B/A 
Prochlorococcus  clade  into  two  clusters  is  well  supported,  as  are  the  high  B/A  clusters  I 
and  IV  (Fig.  1).  The  motile  Synechococcus  isolate  WH81 12  is  closely  related  to  motile 
isolate  WH8103,  consistent  with  results  from  the  ITS.  In  contrast  Synechococcus 
WH8012  is  not  specifically  associated  with  the  other  Synechococcus  in  this  analysis, 
consistent  with  results  from  the  ITS  placing  WH8012  in  a  separate  cluster  from  WH7805 
and  WH8103  (Chapter  4). 

As  described  previously  (West  and  Scanlan  1999),  the  environmental  sequences  from  the 
North  Atlantic  fall  into  two  groups,  one  of  which  is  closely  related  to  MED4  (low  B/A 
cluster  I)  and  the  other  is  closely  related  to  NATL2A  (high  B/A  cluster  I)  (Fig.  1). 
Environmental  sequences  SAR  100  and  SARI 39  group  with  the  motile  Synechococcus 
isolates  in  cluster  II.  The  position  of  environmental  sequence  SAR7  remains  uncertain, 
as  bootstrap  analyses  do  not  provide  consistent  support  for  its  placement  in  either  the 
Prochlorococcus  or  the  Synechococcus  clade. 
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Figure  1.  Evolutionary  relationships  of  Prochlorococcus  and  Synechococcus  16S 
rDNA  sequences  and  environmental  sequences  from  the  Sargasso  Sea  (SAR)  and  the 
North  Atlantic  (ENATL).  Phylogenetic  framework  inferred  using  a  distance  criterion 
using  minimum  evolution  as  the  objective  function.  Bootstrap  values  (from  100 
resamplings)  are  presented  to  the  left  of  each  node,  values  below  50  are  not  shown. 
The  tree  is  rooted  with  Cyanobium  cluster  Synechococcus  PCC6307. 
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Appendix  III 


Secondary  structures  of  sequences  in  chapters  2 
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One  of  the  features  of  ribosomal  RNA  that  makes  it  amenable  to  phylogenetic  analysis  is 
its  conserved  secondary  structure  which  permits  alignment  of  sequences  across  large 
evolutionary  distances  (Olsen  and  Woese  1993).  In  order  to  check  that  base  pair  changes 
between  strains  of  Prochlorococcus  would  retain  a  conserved  secondary  structure 
(Gutell,  et  al.  1994),  and  to  have  a  better  understanding  of  potential  primer  and  probe 
design  sites,  putative  secondary  structures  were  plotted  for  the  16S  rDNA  sequences 
obtained  in  chapter  2  (Fig.  1,  A,  B,  C,  D).  The  overall  structure  was  derived  from  that  for 
Synechococcus  PCC6301  (Gutell  1994)  and  the  Prochlorococcus  sequences  were  plotted 
on  this  framework  using  the  program  CARD  (Winnepenninckx,  et  al.  1995). 
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Prochlorococcus  MIT9313  SSU  rRNA 


,*c\ 


AAAUGC  aucggga  aqaa^  aguggc®a 
GuQGCGa  gU6GCCCUUAAG^3.  qG  CGqA 


'CCACG  UUUUUAGUGCC  CAUUq 

*"  *.**!’’  „a 

Q  Q. 


U.  QUGCUCG^  ^GGUGU  a^agaucacgg  SUUq. 
g  *  c  '•-«  nQ  Q*  uc 

w 


u 


AACUCCAU  ggcagugga  cugauuggc 

JUcGAGGUG  uugucgucu  gacuuuuc%* 

'gaa*  *  c 


d‘AAa.°v= 


152 


Document  Library 

Distribution  List  for  Technical  Report  Exchange — November  1999 


University  of  California,  San  Diego 
SIO  Library  01 75C 
9500  Gilman  Drive 
La  Jolla,  CA  92093-0175 

Hancock  Library  of  Biology  &  Oceanography 

Alan  Hancock  Laboratory 

University  of  Southern  California 

University  Park 

Los  Angeles,  CA  90089-0371 

Gifts  &  Exchanges 
Library 

Bedford  Institute  of  Oceanography 
P.O.Box  1006 
Dartmouth,  NS  B2Y  4  A2 
CANADA 

NOAA/EDIS  Miami  Library  Center 
4301  Rickenbacker  Causeway 
Miami,  FL  33149 

Research  Library 
U.S.  Army  Corps  of  Engineers 
Waterways  Experiment  Station 
3909  Halls  Ferry  Road 
Vicksburg,  MS  39180-6199 

Institute  of  Geophysics 
University  of  Hawaii 
Library  Room  252 
2525  Correa  Road 
Honolulu,  HI  96822 

Marine  Resources  Information  Center 

Building  E38-320 

MIT 

Cambridge,  MA  02139 
Library 

Lamont-Doherty  Geological  Observatory 
Columbia  University 
Palisades,  NY  10964 

Library 

Serials  Department 
Oregon  State  University 
Corvallis,  OR  97331 

Pell  Marine  Science  Library 
University  of  Rhode  Island 
Narragansett  Bay  Campus 
Narragansett,  Rl  02882 


Working  Collection 
Texas  A&M  University 
Dept,  of  Oceanography 
College  Station,  TX  77843 

Fisheries-Oceanography  Library 
151  Oceanography  Teaching  Bldg. 
University  of  Washington 
Seattle,  WA  98195 

Library 

R.S.M.A.S. 

University  of  Miami 

4600  Rickenbacker  Causeway 

Miami,  FL  33149 

Maury  Oceanographic  Library 
Naval  Oceanographic  Office 
Building  1003  South 
1 002  Balch  Blvd. 

Stennis  Space  Center,  MS  39522-5001 
Library 

Institute  of  Ocean  Sciences 
P.O.  Box  6000 
Sidney,  B.C.  V8L  4B2 
CANADA 

National  Oceanographic  Library 
Southampton  Oceanography  Centre 
European  Way 
Southampton  SOM  3ZH 
UK 

The  Librarian 

CSIRO  Marine  Laboratories 
G.P.O.  Box  1538 
Hobart,  Tasmania 
AUSTRALIA  7001 

Library 

Proudman  Oceanographic  Laboratory 

Bidston  Observatory 

Birkenhead 

Merseyside  L43  7  RA 

UK 

IFREMER 
Centre  de  Brest 

Service  Documentation-Publications 
BP  70  29280  PLOUZANE 
FRANCE 


50272-101 


REPORT  DOCUMENTATION  1  •  REPORT  NO. 

PAGE  MIT/WHOI  00-23 _ 


4.  Title  and  Subtitle 

Genetic  Diversity  and  Ecotypic  Differentiation  in  the  Marine  Cyanobacteria 
Prochlorococcus  and  Synechococcus 


7.  Authors)  . 

Gabrielle  Rocap 


9.  Performing  Organization  Name  and  Address 

MIT/WHOI  Joint  Program  in  Oceanography/Applied  Ocean  Science  &  Engineering 


3.  Recipient's  Accession  No. 


5.  Report  Date 

February  2000 


6. 


8.  Performing  Organization  Rept.  No. 


10.  Project/Task/Work  Unit  No. 

MIT/WHOI  00-23 


11.  Contract(C)  or  Grant(G)  No. 

(O  NAG5-3727 
_  OCE-9820035 


12.  Sponsoring  Organization  Name  and  Address 

National  Science  Foundation 

National  Aeronautics  and  Space  Administration 


13.  Type  of  Report  &  Period  Covered 

Ph.D.  Thesis 


15.  Supplementary  Notes 

This  thesis  should  be  cited  as:  Gabrielle  Rocap,  2000.  Genetic  Diversity  and  Ecotypic  Differentiation  in  the  Marine 
Cyanobacteria  Prochlorococcus  and  Synechococcus.  Ph.D.  Thesis.  MIT/WHOI,  00-23. 


16.  Abstract  (Limit:  200  words) 

Primary  production  in  the  open  ocean  is  dominated  by  the  closely  related  cyanobacterial  genera  Prochlorococcus  and 
Synechococcus.  This  thesis  explores  the  relationship  between  physiological  and  genetic  diversity  in  cultured  isolates  of 
Prochlorococcus  and  Synechococcus  and  examines  genetic  diversity  in  natural  cyanobacterial  populations  to  better 
understand  the  ecology  of  this  globally  important  clade. 

The  analysis  of  16S  ribosomal  RNA  gene  (rDNA)  sequences  of  a  spectrum  of  Prochlorococcus  isolates  from  the  world's 
oceans  suggested  that  Prochlorococcus  could  be  divided  into  two  distinct  ecotypes,  designated  high  and  low  B/A  because 
of  their  differing  chi  b/a2  ratios.  To  further  resolve  the  ecotypes,  the  internal  transcribed  spacer  region  (ITS)  between  the  16 
and  23S  rRNAs  was  sequenced  in  isolates  of  Prochlorococcus  and  Synechococcus.  Phylogenetic  analysis  identified  ten 
sequence  clusters,  four  of  Synechococcus  and  six  of  Prochlorococcus,  each  of  which  likely  corresponds  to  an  ecologically 
distinct  population. 

Genetic  diversity  of  natural  populations  of  Prochlorococcus  and  Synechococcus  was  investigated  in  summer  and  winter 
waters  of  the  Sargasso  Sea  by  constructing  clone  libraries  from  PCR  amplified  ITS  sequences,  which  were  screened  by 
restriction  fragment  length  polymorphism  (RFLP).  Based  on  their  phylogenetic  relationships  the  sequences  were  assigned 
to  eight  clusters,  seven  of  which  had  been  previously  identified. 


17.  Document  Analysis  a.  Descriptors 
Cyanobacteria 
Prochlorococcus 
Evolution 

b.  Identifiers/Open-Ended  Terms 


c.  COSATI  Reid/Group 


18.  Availability  Statement 

Approved  for  publication;  distribution  unlimited. 


(See  ANSI-Z39.18) 


19.  Security  Class  (This  Report) 

UNCLASSIFIED 

21.  No.  of  Pages 

152 

20.  Security  Class  (This  Page) 

22.  Price 

See  Instructions  on  Reverse 


OPTIONAL  FORM  272  (4-77) 

(Formerly  NTIS-35) 
Department  of  Commerce 


